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INTRODUCTION 



\ 

^ts^_^issertation constitutes an attempt to provide explanations 
for the syntactic phenomena currently known as "Ross* "constraints." 

\Vhile there have been a number of attempts aimed at providing 
more adequate (in terms of generality, simplicity, or some such .| 
criterion) reformulations of the phenomena- in question, I know of no 
proposal which seriously addresses itself to the task of explaining: 
them. To be more exact, most .discussions of Ross* constraints have 
regarded them as formal syntactic universals, and have assumed that 
an, adequate statement of them would constitute their ultimate 
explanation. 

The position adopted in' this dissertation is that a mere, state- 
ment of the constraints at issue ^ even if otservationally and/or 
descriptively adequate (which most treatments with which- I ain 
familiar are not, as will be shown below), should not be regarded as 
an explanation of the facts. The point of view expressed in the - 
preceding sentence applies with equal force to the position that 
Ross* constraints constrain transformations (henceforth: the 
Transformational Position ) —taken , for example, in Chomsky (196Ua, 
1964b i and 1971), and Ross (1967, and an unpublished proposal known 
as the Island Constraint )— and to the position that the constraints 
in question are global restrictions on derivations (henceforth: the 
Derivational Position )~as taken, ^ for example, in Lakoff (1969), 
Postal (1969), and Ross (l969b). The reason why I have serious 
doubts that a mere transformational or derivational statement can 
-constitute the explanation -of the constraints in question is that it ^ 
is unlikely, given the present state of the field, that any version 
of transformational grammar known at present could be an adequate 
account of the competence of language users. The large number of 
counterexamples which have so far plagued any attempt to provide a 
reasonably elegant account of significant bodies of data, as well 
as the results of recent mathematical investigations of the properties 
of transformational grammars (e.g., Peters and Ritchie, I968), strongly 
•suggest that* a transformational grammar is not a realistic model of 
linguistic competence. If so, it is not in the least clear what, 
if anything, corresponds^to the notion "transformation" in psychological 
terms, and claims to the effect that formal properties of these doubtful 
entities — the transformations — constitute explanations of observable 
facts must be taken with a considerable pinch of salt. 

Given the questionable explanatoiy status of purely formal 
universal constraints, there remain two classes of Tacts vAich may 
provide explanations for syntactic data: semantic and behavioral ones. 
Such facts are on considerably surer grounds, for, while we may doubt 
the existence of transformations, there are no reasons for doubting the 
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existence of concepts, or of perception and production mechanisms. 

. fu*^ respect to Ross'i constraints?, there are clear indications 
that they have very little (if anything) to do with semantics (in 
fact, Ross himself repeatedly emphasized that his constraints 
concerned derived, rather than un derlying , structure). Thus, consider 
the fact tnat sentences vith sentential subjects are s'ubject to one 
of Ross constraints, while the transformational congeners of such 
sentences in which the sentential suoject is extraposed are free 
from the pertinent constraint, even though the two sentence-tyPes 
have Identical underlying representations; consider also the fkct 
that in an analysis which regards relative clauses as derived from 
conjuncts, the clause containing'the relative is free from- constraints 
while the conjunct which underlies it is net; notice algo-that certain 
rules may operate either by moving or by copying a constituent, and, 
although- the two kinds of processes yield exact paraphrases, only the 
former is subject to Ross» constraints; notice, finally, that certain 
feature-changing processes, such as the one that places an overt 
mark on question-phrases, is free from constraints in Jatjanese, but 
not m English. I believe that the few facts mentioned above show 
quite conclusively that Ross' constraints cannot be explained in terms 
of semantics, for, if semantic representations were subject to such 
constraints, any surface structure derived from ill-formed representa- 
tions should itself be ill-formed, which, as we have seen above, is 
not the case. 

As a semantic explanation is not available, and as a mere formal 
statement cannot be regarded as an explanation (at least at the 
present time), it remains to investigate the possibility that 
sentences in which Ross' constraints have been violated conflict 
with certain properties and/or limitations of the perceptual apparatus ; 
the mam goal of this dissertation- is to investigate this possibility. 
In doing this, I will rely, to some extent, on the theory of the 
interaction of perception and grammar proposed in a number of papers 
by Fodor, Garrett, Bever, Langendoen, and others. However, given 
the highly tentative and often questionable status of many of its 
substantive claims, that theory will be used as sparingly as possible. 
Specifically, I will confine myself to three types of perceptual 
principles in proposing explanations for syntactic phenomena, namely, 
principles which invoke the complexity resulting from erroneous 
closure , interrupted behavior, and perceptual conflict respectively. 
Closure principles assert that complexity arises when a proper 
subpart of a structure is mistakenly apprehended as a well-formed 
subpart of that structure, with the result that the remainder of 
that structure appears ill-formed; interruption principles assert 
that a proper subset of the set of discontinuities create complexities 
m proportion to the values of specific parameters, such as the length , 
structural complexity, overall structural predictability , etc., of 
the intervening material; .conflict principles assert that complexity 
arises when two sets of cues assign contradictory values to a stimulus 
in terms of some parameter. 

In using closure, interruption, and conflict principles, I shall 
attempt to narrow down each principle as much as possible, for merely 



- 3 



saying that erroneous closure, interrupted behavior, or perceptual 
conflict are complex is general to the point of nea^-vacuSv Each 
dItT'co --^"''^'^ constrained lo the exte^ S^oied'by the 

ITL IT^T P^^^^^^ility. and past experimental fiLi^ps 

but no attempt will be^ade to offer a definition of the notion 

JT*^=iPl«";-it seems reasonable to assmie that 
should be hard, if not impossible, to comfby Jn 
the absence of an adequate and comprehensive theoiy of linJ^^isUc 
perception, an extremely remote goal at present In kn^^ fi 
that a definition of t^e notion "possible "Sce^tuS pr Scijle" " 
should be a major ^tof psycholinguistic reseLc^ StL" than one 
Of its prerequisites. In .this sense, each proposed priJcSle w 
be viewed as a tentative partial definition of „^+r h 
perceptual princ dple". Clearly iTtllllu^rl ^La t ° "o 'th^''^ 
proposals I shall make_is undeniable, and follows directi? fJom the 
sllnlTilTT r ' theory of linguistic perception; Jhe 

lr Tr, °^ . ' ''''^''^f' *hat Obtaining in other ?ields 

iLllT " ieation, for no one has, to the best of ni.y 

wri?oSV"''T'f ^ ^^^•l^^*^ definition of the notions "poLible 
r'Se"re^:!!°so'f2?"'''^ "^ossi^e phonological 

■ The relation between behavioral' complexity and acceptability- 

judgments IS fairly transparent in some of the cases I consideJ for 
example it is rather easy to show that the relative acceptebUitr 
Of tll'lT "^''^ the length and/or structura/comjlexity 

of the intervening material is varied. However. -not all situations 
are equally clear in this respect, as (a) some ParameSrsrespeJi^ly 
m cases involving conflict, often exhibit a very limited ""nmber S 
possible values, and (b) eve., when an arbitraril^ l^ge nu^^r of 
values IS possible with respect to-some parameter, the corSation 
between perceptual complexity and degree of acceptability i. sometimes 
Obscured through generalization, a process vhich^mposes a "es/no 
dichotomy on the scalar acceptability dimension; the underlying^ 

for the -^^^^T '^^^^^y^"^- *han initial , to allow 

J^rn^^tnor L*^^* '""^ instances of grmrnnaHi^n of 

perceptual restrictions do not necessa^-ily occur at some stage in ' 

fo^ J . .r^^"^^"" °^ t l^e^^Be. but more or less inevitably 
IZrZ r ^"^^""^ (presumably due to an intolerably hiph 

a Sntence'S^'S''^^- .Therefore. I am not making the claim^haf 
a sentence like *hei said tha t John^ had l«.ft was necessarily 
grammatical at some point in the history of English-. 



correlation can m fact be further obscured by reinterpretation, 
generalization, and other extremely common phenomena in language 
change Such difficulties should not, however, preclude Sirtes 

llTs 'J^t^'v '''''^ ungrammatical construct ons a "ire 

this status because of some high degree of perceptual comnlexity; 
just as the fact that certain rule-systems are synchronically 
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unproductive has not, in general, precluded research as to the 
possibility of an earlier productive state. In both situations, 
empirical claims are made, and it should be possible to test them, 
in the latter case through an exaraination of historical records,, 
in the former, through psycholinguistic experimentation (of course » 
tests aimed at proving a putative underlying perceptual -complexity/ 
acceptability correlation con obviously not be 'applied directly to 
structures which have undergone grammatization, but rather to (a) 
the same structures, in dialects (or languages ) whore grarpmatization 
has not taken place, (b) other' strSftures , in the same dialect, 
provided that the relevance *of these structures to the primary y 
ones can be defended, or (c) representations in other perceptual 
modalities, subject to the condition expressed with respect to (b)). 

It would be extremely interesting' to try to find out whether 
there is some quoatifiable threshold of perceptual complexity which 
when reached, allows reasonably certain predictions that graihmatizatior 
^ill occur. To ask this question meaningfully, it would be necessary, 
at the very least, to know (a) what all the pei'ceptual variables 
which may increase or reduce complexity are, and (b) what weights 
should be assigned to the various variables in situations where a 
subset (not necessarily proper) of them interact* As the answer to— 
the former, let alone the latter, question is not known at the 
present time, it is clear that any attempt to compute such a 
greumnatization threshold would be hopelessly premature at this stage. 
For example, one may attempt the generalization that "total", as 
distinct from "partial", perceptual conflict is necessarily 
grammatized (for a discussion of the terms in inverted commas, see 
section ^+.3); however, this "generalization" will be seen to be 
incorrect in general, precisely because there exist mit,igatinf . in 
addition to aggravating ; factors in perception. ~ 

This dissertation is incomplete in two important ways: (a) its 
theoretical claims are seriously tmderdetermined by the data, as I 
have not been able 'to gather enough pertinent facts from a sufficiently 
rich sample of languages, and (b) while it is consistent with the 
results of past experiments, it is not supported by any especially 
designed experiments, owing to the fact that I have had neither the 
time nor the facilities for devising and carrying out pertinent tests. 
Fortunately, both shortcomings can be remedied through future work. 
I believe that despite these two objections whose seriousness should 
not "be underestimated, this dissertation does make a contribution to 
the study of syntax by providing an account which is empirically and 
descriptively superior, as well €is more elegant, plausible, and 
intuitively satisfactory, than previous treatments of the same 
problems. 

The breakdown of this dissertation is as follows: 
Chapter One examines the most important earlier proposals made 
within the Transformational Position and shows them to be inadequate 
on both empirical and explanatory grounds. The Derivational Position, 
which has never been" spelled out in detail, is pursued to some extent, 
and it is argued that empirical adequacy could be achieved at the cost 
of making unggstric^gd use of global constraints, as well as allowing 



reference to .dg^gggg, of accepv ' -rility and to pjes^^^ig, information; 
it is fxirther argued that explanatory adequacy would be highly 
improbable under those circumstances^ since it is hard to see what " 
could not be described with such a powerful apparatus. 

Chapter Two consists of three parts: section 2.0 briefly 
outlines the .theory of perceptual strategies sketched in Sever 
(1970); srjction 2.1 provides-. illustrations of the applicability of 
the clos^zre, conflict, and interruption, principles, and focuses 
essentially on cases in which the complexity/acceptability correla- 
tion i!^ relatively transparent; section 2.2 examines the validity of 
a putative perceptual strategy (whose correctness has usually been 
riSsuaed) to the effect that syntactic choices are made on the basis 
of ^'minimal distance" considerations (.the plausibility of this 
assumption being probably due to the independently attested "recency 
effect" in recall e^qperiments ) ; it is argued that there is little 
or no evidence that minimal distance principles play a significant 
part in syntax; such principles are often in conflict with the facts, 
and> when not in conflict, more convincing alternative explanations 
are available. 

Chapter Three presents a detailed defense of my proposed 
reanalysis of three constraints (the Complex NP Constraint, the 
Coordinate Structure Constraint, and a third constraint involving 
adverbials), which, I argue fall under the ^same generalization. 

Chapter Four employs the three perceptual principles discussed 
in Chapter Twp toipropose explanations for the main island constraints 
in the light of the reanalysis of Chapter Three ♦ 

Chapter Five summarizes the results of the dissertation. 



CHAPTER I 



THE TRANSFORMATIONAL AlW DERIVATIONAL POSITIONS 



1.0. The main subdivisions of this chapter^ nawely^ 1.1^ 1.2^ 
1.3» and 1.1*, are respective3y concerned with Chomsky's initial 
proposal knovn as the A-over-A Principle, with the extensive study 
made in Ross (."IS6T), with Chomsky's reanalysis of much the same facts 
within the framework of his Extended Standard Tlieory, and with Soss' 
attempt to collapse most of his constraints into an Island 
Constraint; all these proposals espouse the Transformational Position. 
The Derivational Position is discussed in 1.2.6 in relation to Ross* 
original position. ^ 

1.1.1. In. Chomsky (l961*a), it was pointed out that the o^ucstion 
•and relative clause transformations are ambiguous when applying to 
a noun phrase modified by a full or reduced relative clause. Thus, 
in (l.l), both the phrase the boy from Los Angeles end its head the 
\)oy are noun phrases; however, the question or relative clause trans- 
formations must apply only to the former, yielding the sentences in 
(1.2), rather than to the latter, as this would yield the ungrammatical 
sentences in (1.3). 

(1.1) You believe the boy from Los Angeles to be 

unbalanced. 

(1.2) a. \-^o do you believe to be unbalanced? 

b. The boy from Los Angeles who you believe to 
be unbalanced has won the Nobel Prize. 

(1.3) a. *Who do you believe from Los Angeles to be 

' unbalanced? 
b. ''The boy who you believe from Los Angeles to 
be unbalanced has won the Nobel Prize. 

Chomsky assumes that transformations roust be unambiguous and propot^es 
a hypothetical linguistic universal that will eliminate the ambiguity. 
This universal (which, following Ross, I shall call the A ^over^-A 
principle ) was stated formally as follows: 

(l.l*) •..if the phrase X of category A is embedded within 
a larger phrase ZXW \diich is also of category A, 
then no nxle applying to the category A applies 
to X (but only to ZXW). 

_ 1.1. 2. In Chomsky (I96l4b), a revised version of Chomsky (l961*a), 
it is no:lnted out in note 10 that the A-over-A principle is too 



strong, as it would predict that ser^tencefi like (1.5) are ungranroatical. 

(1.5) Who do you approve of my seeing? 

(l»6) You ^^prove of my seeing someone. 

Indeed, on the assumption that (I.6) is the so\urce of (1.5)? it contains 
the HP someone embedded within the larger NP my seeing someone ^ and , 
still the question transformation can epply to the lower NP without 
xmgrammatlcality« 

I sho\i3d like to point out at this stage that the failure of 
the A-^ver-A principle to allow the generation of (1.5) depends 
crucially on the overall structure of the grammar which incorporates 
the principle. Specifically, it depends on ray seeing someone being 
dominated by the node NP. If the A-over-A principle is incorporated 
into a graiaaar which generates, all 'complement sentences tinder the 
domination of the node NP (as proposed in Stockwell et al., I968, for 
example), then it becomes a trivial matter to show that the A-over-A 
principle is incorrect, for It would predict that no NP moves out of 
an embedded clmise, and this 1'^ clearly not the case (at least for 
English). 

1.2.1. In Ross (1967), the mosb thorough treatment of the 
constraints at issue within the Transformational Position, Ross 
mentions Chomsky* s observation with respect to sentences like (1.5), 
and adds that the A-over-A principle is also too strong with respect 
to certain cases which exhibit an unbounded sequence of NPs, such 
that for any two NPs X and Y, X either dominates Y or is dominated by i 
it. Specifically, given a string like (1.7) » the A-over-A principle , 
predicts that (l..3a) and (l.9a) alone are grammatical, and incorrectiiy 
rules out (l.8b, c) and (l.9b, c). 

You saw a picture of the master of the house* 

a. What did you see? 

b. Who did you see a picture of? 

c. What did you sea a picture of the master of? 

a. It's a picture of the master of the house 
that you saw. 

b. It's the master of the house that you saw 
a picture' of. 

c. It's the house that you saw a picture of the 
master of. 

The failure of the A-over-A principle in cases like (1.7), where 
nodes of some type A can be recursively embedded to the right of 
other nodes of Type A, is in fact more general, since the variable 
A can stand for S or VP, not Just NP. Thus^ Topicalization and Pseudo- 
Clefting can apply to any of the S nodes in the underscored portion 
of (1.10), yielding (1.11) and (1.12) respectively, while Pseudo- 
Clefting can apply to any of the VP nodes in the underscored portion 
of (1.13), .yielding (l.ll*)- 



(1.7) 
(1.8) 

(1.9) 
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jlieve that John claimed that Mary insinuated 
tat Bill v/as foiilty of murder . 

Joiin claimed that Mary insinuated that Bill 

was guilty of murder, I believe. 
Mary insinuated that Bill was guilty of 

murder, I believe that John claimed. 
Bill was guilty of murder, I believe that 
John claimed that Mary insinuated. 

What I believe is that John claimed that Mary 

insinuated that Bill was guilty of murdeii^ 
What r believe that John claimed is that 
Mary insinuated that Bill was guilty of 
murder* 

What I believe that John claimed that Mary 
insinuated is that Bill was guilty of 
murder. 

(1.13) John decided to try to begin .to write a book. 

(1.14) a. What John decided was to try to begin to 

write a book. 

b. What John decided to try was to begin to 

write a book. 

c. Vrnat John decided to try to begin was to 

write a book. 

1.2.2. After having considered the cases which were incorrectly 
ruled out by the A-over-A principle, Ross turns, in section 2.2, to 
the six-*- cases in (1.15) which, according to him, can be handled by 



Ross also mentions a seventh case, suggested by McCawley and 
involving the Adjective Shift Rule, but concludes in section 2.3 
that the nxle itself is inadequate and that a more satisfactory 
formxilation would no longer require the A-over-A principle. 



(1.10) I be 

th 

(1.11) a. 
b. 



(1.12) a. 

b. 

c. 



that principle in a satisfactory manner: 

(1.15) a. Elements of relative clauses cannot be 

questioned or relativized; thus (l.l6) is 
ungrcunmatical • 

(1.16) *This is the girl who I know a boy who likes. 

(1.15) b. Elements of sentences in apposition to 

"sentential'^ nouns like fact , idea , question , 
etc., may not be questioned or relativized, 
as may be seen in (1.17): 



t 
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(1.17) *Wao did I mention to you the fact that John 

seduced? 

(1.15) C-. A clause modifying a head. UP cannot be extra- 
posed beyond "the first sentence ud", as 
shown in (l.l8). 

(1.18) »A proof that the claim had been made was Riven 

that John had lied. 

(1.15) d. The head of a relative clause cannot be 

questioned or relativized, and neither can 
a noun like fact or idea when it is modi^-ied 
by a clause (Ross omits to mention the 
latter case); these two restrictions are 
exhibited in ( 1.19a, b) respectively: 

(1.19) a. »V,'>io did he expect who I was acquainted with 

would show up? 
b. *What do I believe (that) that John had lied 
is well established? 

(1.15) e. A HP which is exhaustively dominated by a 
^ Determiner cannot be moved out of the NP 

which immediately dominates that Determiner 
as in (1.20): ' 

(1.20) *Whose did you buy house? 

(1.15) f. A conjunct or a disjunct IfPin a coordinate 
node cannot be moved out of the latter as 
seen in (l.2l). In fact, the A-over-A 
principle prevents the movement of a member 
of anjr coordinate node, and this restriction 
IS necessary, for Ss and VPs can also move, 
as seen in (l.ll), (1.12), and (l.Hi), and 
the ungrammatical (1.22) and. (1.23) must 
be prevented. 

(1.21) a. n.'hat will you put between the bed and? 
b. *V/hat will you put between and the wall? 

(1.22) a. *What John claimed that Mary left and was 

that Bill rem away, 
b. *vnjat John claimed and that Bill ran away 
was that Mary left. 

(1.23) a. »What John likes to eat ana is to drink, 
b. *Vhat John likes and to drink is to eat.* 

R«o. "r^!?^ ''^"'^ A-over-A principle was too strong, 

Ross proposes to handle the cases which the principle was adequate ' 



for by four separate constraints. These four constraints taken 
together are weaker than the A-over-A principle, as they account for 
th^ (I.15a)-(l.l5f ) cases without starring the sentences in (l.5), 
(1.8), (1.9), (1.11), (1.12) and (l.lU), but they are also stronger, 
as they can account for additional cases. This last fact does not 
constitute a defect of the A-oyer-A principle, for the latter was 
never meant to account for all the constraints in a grammar; it 
does, however, strengthen Ross? analysis . The main wg^ in which Ross' 
cons^traints stren^hen the A-over-A principle concerns the fact that 
elements of conjuncts or disjuncts cannot move out of the coordinate 
node; this constraint is not expressed by the A-over-A principle in 
those cases vhere the coordinate node and the elements prevented 
from moving belong to different categories, as in (l.2i*a), where the- • 
un_derscored element cannot move to yield (l»2Ub): 

(l.i?U) a. John loves Mary and Bob hates Jill. 

b. *It's Mary who John loves and Bob hates Jill. 

Sentences like (l.25b) and (l*26b), whose ungrammaticality is 
due to the migration- of- the nodes VP and ADJ respectively from a 
complex NP, could also be claimed to strengthen Ross' reformulation 
of the A-over-A principle. 

(1.25) a. I believe the claim that John decided to 

try to write a book , 
b. \7hat I believe (»the claim) that John 
decided to try was to write a book. 

(1.26) a. John heard the claim that Maiy was pretty, 
b. Pretty though John heard (*the claim) that 

Mary wus, he never showed much interest 
in her. 

However, these sentences are not relevant in the model of grammar 
Ross proposes, for he regards VP^s and predicative ADJ's as dominated ' 
by the category NP. 

1.2.^1. The four constraints Ross puts forward in an attempt to 
overcome the inadequacies of the A-over-A principle are: The Complex 
Noxrn Phrctse Constraint , The Coordinate Stnxcture Constraint , The Left 
Branch Condition on the Pied-Piping Convention , arid The Sentential 
Subject Constraint (henceforth, the CNPC , the CSC , the LBC on the PPC 
and the SSC respectively) . The first two are hypothesized to be 
universals ,2 the last two are claimed to be language-specific. 



The iiniversality hypothesis was subsequently shown to be 
false by a number of investigators. Thus it appears that the 
Complex NP Constraint does not hold in Swedish (D. Perlmutter, 
personal communication) or in Portuguese (C. Quicoli, personal 
commimication) . This demonstrates the incorrectness of various 
attempts to' explain Ross* constraints on semantic groiinds, since, if 
the structures at issue were ill-formed semantically , there shoxzld 
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be no exceptions in any language whatever. In contrast, the 
explanation which I shall propose, and which ultimately involves 
the violation of conversational and/or perceptual principles is 
not defeated by a few counterexamples, since principles of this 
kind can be violated. x- ±' 



(1.27) The CHPC: No element contained in a sentence 

dominated by a noun phriise with a lexical 
head noun may be moved out of that noun 
phrase by a transformation. 

Ross treats the feature CtLexicalD as a formal one, but the example'; 
he gives from English and Japanese suggest that the plus^^value is 
associated vith semantically empty forms, like the .English it that 
results from Extraposition, or the Japanese koto, mono, whi^ 

translate roughly as "thing". 

The CNPC takes care of cases • (l. 15a) and (I.15b) that were 
accounted for by the A-over-A principle, and can also block the 
movement of any kind of element out of the modifying clause, while 
the A-over-A principle can only block NPs. 

(1.28) The CSC: In a coordinate structure, no conjunct 

may be moved, nor may any element contained 

in that conjunct be moved out of that conjunct. 

It is clear that the first part of this constraint has the same 
effect as case (l.l5f) of the A-over-A principle, while the second 
part covers the migration of elements of coordinate terms , not 
covered by the latter. A number of count,erexamt)les to (1.28) 
Unvolvmg asymmetric coordination and across-the-board operations) 
which Ross brings up will be discussed in detail in Chapter Three. 

(1.29) The PP£: Any transformation which is stated 

in such a way as to effect the reordering of 
some specific node NP, where this node is 
preceded and followed by variables in the 
structural index of the rule, may apply to 
this NP or to any non-coordinate NP which 
dominates it, as long as there are no 
occurrences of any coordinate node, nor of 
the node S, on the branch connecting the 
higher node and the specified node. 

(1.29) has the effect of allowing the sentences (l.8b, c) and 
tl.9b, c) which the A-over-A principle ruled out. As I have already 
pointed out (1.29) needs to be strengthened in order to account for - 
the b and c cases in (l.ll), (1.12) and (l.Hi). Of the five language- 
specific conditions that Ross imposes on the PPC, I shall mention 
only the LBC as the other four strike me as truly idiosyncratic 
grammatical facts and thus of little interest for the Behavioral 
Position I have espoused. 
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(1.30) The LBC: No NP which is the leftmost constituent 

of a larger ITP can be reordered out of this 
NP by a transformational rule* 

The LBC accounts for cases (l.l5d) and (l.l5e) of the A-over-A 
principle. 

The fourth constraint Ross proposes, the SSC, accounts for the 

unacceptability of ( 1.31b) in contrast to the acceptability of (l.32b) 
and (1.33b). 

(1.31) a. That John loves Mary is odd. 

b. *It*s Mary who that John loves is odd. 

(1.32) a. It's odd that John loves Kaxy. 

b: It's Mfiury who it's odd that John loves. 

(1.33) a. Bill claims that John loves Mary. 

b. It's Mary who Bill claims that John loves. 

(1.3^) The SSC : No element dominated by an S may be 

moved out of that S if that node S is dominated 
by an NP which itself is immediately dominated 
by S. 

It is not clear whether the A-over-A principle was initially intended 
to account for sentences like (l.31b), for it is not clear whether 
Chomsky considered complement sentences to be dominated by NP. I 
would imagine that he did not, for counterexamples like (1.32b) and 
(l.33b) are much too obvious. In any event,, in a grammar which does 
represent all complement sentences as dominated by NP, the A-over-A 
principle is much too strong, and the SSC is a welcome weakening. 

The four constraints mentioned so far do indeed improve on the 
A-over-A principle. However, there is another class of configurations 
which impose constraints on movement transformations, namely, the 
adverbials, and neither the A-over-A Principle nor the four above- 
mentioned constraints handle these cases adequately. Thus, it seems 
that at least the optional (in a sense to be made more precise in 
•Chapter Three) adverbials cannot lose elements. In talking of 
adverbials, I shall refer ambiguously to the entire adverbial, or 
to the adverbial minus the "connector" which introduces it (pre- 
position, 'subordinating conjunction', or whatever; the term 
'connector' is used for purely mnemonic purposes, and no claim is 
made that a category Connector should be part of the non-terminal 
vocabulary of the grammar ) ; the reason is that the 'smetLler' 
adverbial as a whole can sometimes be moved (even though it is an 
element of the 'larger' adverbial)., but elements of the smaller 
adverbial cannot be moved. 

In Ross (1967), the adverbs are treated as a subset of the 
complex NP category. That is, it is assumed that all adverbs are 
complex NP's at some level of representation, and that movements 
out of adverbial clauses occur prior to some transformation which ■ 
substitutes subordinating conjunctions for the heads of complex NP's 
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(and, perhaps., some more prelexical material) • The hypothesis that 
adverbs originate as complex NP's is due to Geis, who supports 
It extensively, and, I think, convincingly, in Geis (1970) • Even 
though Geis does not carry out a detailed analysis of all the forms 
which traditional grammarians have called "subordinating conjunctions" 
he strongly implies that all adverbs should be analyzed as complex 
NP's. 

Given the strong evidence supplied by Geis for the cases he 
analyzed, there is no reason to reject outright the hypothesis that 
all subordinating conjunctions may originate as complex NP's. 
However, even if such an analysis ultimately turns out to be correct, 
there are several cases involving adverbials in which movements are 
blocked and which it cannot account for. 

One such case concerns the parentheticals , which, being 
(sentential) modifiers are (at least surface) adverbials. As (1.35) 
shows, such parentheticals become islands only upon becoming sentence 
modifiers. The difficulty .here is that it is hard to see how they 
could be analyzed as complex NP's. 

(1.35) a* I told John that Bill was sick. 

b. Bill was sick, I told John. 

c. It's me who supposed that John is sick; 

d. *It*s me who John is sick, supposes. 

This claim is independently supported by the unambiguity of 
(l.'36c), which cannot be read as a paraphrase of (1.36b). Neverthe- 
less, even though the parenthetical seems to be a deep, rather than 
a surface, structure modifier here (see Nobel, 1971), there is still 
no natural way of analyzing it as a complex NP. 

(1.36) a. I suppose John is sick. 

b. John is sick, I suppose. 

c. It's me who supposes that John is sick. 

d. *It*s me who John is sick, supposes. 

A second case which raises problems for the CNPC-reductionist 
hypothesis concerns the comparative and equitative constructions. 
The source of such constructions is one of the least well understood 
problems in transformational grammar, 3 but I shall assume an analysis 



-^Ross (1969a) proposed (ib) as the source of (ia), an an 
attempt to explain the occurrence of an^ or ever inside comparative 
clauses, as in (iia), as well as the ill-formedness of comparative 
clauses containing overt negation, as shown by (iib). 

(i) a. John is taller than Bill. 

b. John is tall to an extent to which Bill is not 
tall. 

(ii) a. John is richer than any of my friends has ever 
been. 

b. *John is richer than Bill isn't. 



I 



^ In his dissertation, Ross no longer defended (ib) as the source 
of via), ^as he had noticed that equitatives exhibit the same pro- 
perties as con5>aratives, as can be seen by comparing (ii) and (iii). 

(iii) a. John is as rich as any of my friends has ever 

been . 

b. *John is as rich as Bill isn't. 

As there is no obvious way in which equitatives can be analyzed as 
containing negation, Ross presumably concluded that the facts of 
(iia) and (iib) are not determined by negation and therefore do not 
force an analysis of (ia) along the lines of (ib). 

There is also a second difficulty with Ross* analysis in (ib); 
(ia) and (ib) are not paraphrases. Thus, if John is taller than 
Bill is ne cessarily not taller than John, while if John is 
tall to an^ extent to which Bill is not. Bill necessarily is tall to 
an extent to which John is not; in other words the relation between 
John and Bill is asymmetric in (ia) and symmetric in (ib). 

I believe, however, that Ross* initial analys^is of comparatives 
made an- important point, namely, that comparatives should be 
analyzed as containing negation at some level of representation, 
fiLlthough not necessarily as in (ib). 

One fact ^Al^ch suggests that comparatives are inherently 
negative is the manner in which they interact with Coordination- 
Reduction. 

Consider first (iv) and (v). 

(iv) a. John thinks that Mary is pretty and John 

thinks that Jill is ugly. 

b. I John thinks that Mary is pretty and that Jill 

is ugly. 

c. John thinks that Mary is pretty or that Jill is 

ugly. 

(v) a. John doesn't think that Mary is pretty and John 
doesn't think that Jill is ugly. 

b. John doesn't think that Mary is pretty and that 

Jill is ugly. 

c. John doesn't think that Mary is pretty or that 

Jill is ugly. 

The reduced paraphrase of (iva) is (ivb), not (ivc), while the 
reduced paraphrase of ^(va) is (vc), rather than (vb). The principle 
which determines the change from and to or in (v) is known as 
DeMorgan's laws. I shall now show that DeMorgan's laws operate 
in comparatives, but not in equitatives, which strongly suggests 
that the former, but not the latter, are inherently negative. Thus, 
consider (vi) and (vii). 

(vi) a. John is as clever as Mary is astute and John 

is as clever as Jill is sly. 

b. John is as clever as Mary is astute and as Jill 
is sly. 
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(vi) 



(vii) 



John is as clever as Mary is astute or as Jill 
is sly. 



a. 



John is cleverer than Mary is astute and John 
is cleverer than Jill is sly. 

b. *John is cleverer than Mary is' astute and than 

Jill is sly. 

c. John is cleverer than Mary is astute or than Jill 

is sly. 

In (vi), as in (iv), the reduced form of the a^sentence is the b-one, 
while m (vii), as in (v) the reduced form of the a-sentence is the 
c-one. It IS difficult to see what, outside of DeMorgan's laws, 
could deteimne the difference between the paradigm in (vi) and the 
one in (vii). 

Another piece of evidence, weaker than the first one, but 
which nevertheless suggests that comparatives are inherently negative, 
IS provided by the surface structure of French comp'aratives and 
equitatives. In French, if the verb of the comparative or equitative 
clause has not been deleted, there must be a negative morpheme in 
the former case, but there cannot be one in the latter, as shown in 
iviiib; and (ixb), the French counterparts of (viiia) and (xia) 
respectively. 



(viii) 



(ix) 



a. 
b. 

a. 
b. 



John is tcdler than George 
Jean est plus grand que George 



I'est, 



John is as tall as George is. 
Jean est aussi grand que Georges 



The acceptable version of (viiib) is a oaraphrase of (viiia), not of 
th& semantically illrformed (see below). * John is taller than George 
isn_t. The particle ne has no semantic import, which is shown by 
the paraphrase relation between (viiia) and (viiib), as well as by • 
the fact that semantic ne always has a reduplicated clause-mate , 
usually £as, while no such reduplication is possible in (viiib), 
as shown by the ungrammaticality of * Jean est plus grand aue Georf;es 
po I'est pa£. Rather, ne is probably itself a reduplication of some 
semantically negative morpheme in the string, which seems quite 
plausible in view of the fact that French has a rule reduplicating 
negation in any case (although the reduplicative rule in (viiib) is 
probably not the same as the rule which yields Jean n'est pas ici . 
since the reduplicated form is necessarily a clause-mate of the 
original in the latter case, but not in the former). If ne in 
(viiib) is a reduplicated form, it is hard to see what the semanti- 
cally negative element in the string is, outside^pf plus 'more'. 

The import of the above discussion is that negation should not 
appear inside the comparative clause, as in (ib), but rather in the 
semantic representation of more . I have not found, however, a 
satisfactory way of doing this. One possibility would be to 
represent more (than) as NOT EQUAL (TO) . However, this representation 
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vould be incomplete, for not equal to is a symmetric relation, 
while more than is asymmetric; in fact, not equal to is equivalent 
to more than or less than , so that a semantic feature like C-EQUALD 
could appear in the semantic representation of both more and less, 
provided that there is another feature which distinguishes bet^^n 
the two. relations, by having the value + in one and - in the other; 
the problem is that it is not obvious what that feature should be. 
It should be noted, incidentally, that less, has the same inherently 
negative semantic properties as more ; this can be vcri^-ied by 
substituting less, for more and moins for 2lus in the above examples. 

Various generative - semanti cists have T>roT)osed that (ia) be 
repifesented as (x). 

(x) John is tall to an extent which exceeds the extent 
to which Bill is tall. 

This analysis, has certain difficulties which I shall touch upon 
further down in the text, but one rather interesting difficulty 
which, as far as I know, has not been noticed so far, is that although 
exceed is synonymous with be more than , the latter only is inherently 
negative. Thus, e xceed exhibits the paradigm in (xi), which, 
surprisingly enough, is similar to the paradigm in (vi) rather than 
to the one in (vii), since the b-, rather than the c-sentence, is 
a paraphrase of the a-one. 

(xi) a. John exceeds Mary in intelligence and John 
exceeds Bill in temerity. 

b. John exceeds Mary in intelligence and Bill 

in temerity. 

c. John exceeds Mary in intelligence or Bill 

in temerity. 



In conclusion, I )iave been unable to discover an adequate 
semantic representation for more than and less than , beyond the 
observation that they must somehow involve negation. 

It is also worthwhile to evaluate the evidence in (ii), which 
led Ross to positing (ib) as the source of (ia). (iia), is, in all 
probability, irrelevant to negation, in view of the acceptability 
of (iiia); the conclusion seems rather unavoidable that any and 
ever occur in more than just negative environments. Concerning (iib), 
Ross implied that it was out for the same reason as * John didn't 
see nothing is out in standard English; in other words, he attributed 
the badness of (iib) to a surface property of English. Howev-r, 
I believe that (iib), as veil as (iiib), is out on semantic j^rounds. 
Thus, only scalar properties can be compared, and (at least certain) 
negated adjectives do not stand for scalar properties; this can be 
seen by considering the ill-fomedness of » to a very high degree. 
John isn't clever. *John is exceedinfsly not cleve r, etc. In short, 
I am claiming that (iib) and (iiib) are bad for the same reason for 
^hich »John is cl everer than Bill is an assistant professor is bad. 
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consonant with Geis' reductionist hypothesis. Thus, I shall assume 
that the source of (l.37a) is roughly (l.37b), and that the source 
of (1.37c) is roughly (l.37d); these assumptions make it possible to 
exclude (l.38a) and (l.38b) by the CNPC. 

(1.37) a. John loves Mary more than he hates Bill. 

b. John gloves Mary to a degree which exceeds 

the degree to which he hates Bill. 

c. John loves Mary as much as he hates Bill. 

d. John^loves Mary to a degree which 

{ is the same asl 

L equals J degree to which 
he hates Bill* 

(1.38) a. *It's Bill who John loves Mary more than he 

hates, 

t). *It's Bill who John loves Mary as much as he 
hates* 



In Chapter Six of his thesis, Ross points out that' feature-changing 
rules are also: subject to the CNPC. However, although the rule 
which permits the occurrence of any or ever can go into a 
comparative clause, it cannot go into what the reductionist hypothesis 
suggests as its source, as shown in (1.39). 

I 

(1.39) a. John is richer than any of my friends has 
ever been. 

b. *John is rich to an extent which exceeds the 

extent to which any of my friends has 
ever been rich. 

In order to explain both (1.38) and (1.39), it becomes necessary to 
assume that the transformation which replaces an extent_which 
exceeds t he extent to which by more than Ergce^s. feature-changing 
rules, but follows, chopping rules. Unfortimately, there is evidence 
of another sort which requires that the transformation in question 
32£gcg^e chopping rules. Indeed, elements of comparative clauses 
can be chopped following Comparative relation, as shown by (l.l*0). 

(l.l*0) a. John is taller than a man. 

- b. I know a, man who John is taller than. 

c. *John is* tall to an extent which exceeds 

the extent to which a man. 

If the lexical transformation follows the chopping rules, (l.l*Oc) 
would become the input to Relativization, and the latter would be 
blocked by the CNPC, since a man is inside a relative clause; there- 
fore, we would wrongly predict that (l.l*Ob) is unacceptable. Thus 
(1.38) requires that the lexical transformation follow Relativization, 
while (1.^0) requires the opposite ordering; the resulting paradox 
suggests that the lexical transformation we have posited does not 
exist, and therefore that (l.37b) and (l.37d) are not the sources 
of (l.37a) and (l.37c) respectively. 
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A third case which creates rather serious problems for the 
reductionist hypothesis is illustrated by the contrast in 
graiamaticality between the members of the pair in (1.1*1). 

(l^Ul) a. (?)The accident ^Axich a reporter has just 
disclosed the place of took five lives, 
b. *The accident which the police found Maiy's 
body at the place of took five lives. 

In both (l.Ula) and (l.Ulb) an element of the construction the 
Place of the accident has been relativized. Regardless of whether 
that construction can be analyzed as a complex NP ov not, there 
is no reason to believe that the transformation(s) by virtue of 
which it assumes its surface form is ordered before Relativization 
in (l.Ula) but'after Relativization in (l.Ulb). Thus, unless we 
refer to the fact that the place of the accident is an adverbial 
in (l.Ulb) but not in (l.41a), there seems to be no way of accounting 
for the difference in acceptability between these two sentences. 

A similar difficulty is created by reduced comparative clauses. 
As (l.l*Ob) sho'fs, the adverbial (without its connector) can be 
moved, but elements of that adverbial cannot, as shown by the 
contrast in acceptability between (l.l*2b) and (l.l*2c). 

(1.1*2) a. John is taller than a member of the 
committee. 

b. This is the member of the committee who^ 

John is taller than. 

c. *This is the committee which John is taller 

than a member of. 

Paradipns similar to (1.1*2) can be constructed using other reduced 
adverbial clauses, as shown in (1.1*3) and (1.1*1*). 



(1-1*3) 



a 



John left the city because of a brother of 
the queen. 

b. ?This is the brother of the queen who 
John left the city because of. 

c. *This is the ,queen who Joh^ left the city 
because of a brother of. 

a. John left the city in spite of a brother of 

the queen. 

b. ?This is the brother of the queen who John 

left the city in spite of. 

c. *This is the queen who John, left the city 

in spite of a brother of* 

To summarize: the three types of islands^ we have examined above. 



(lAl*) 



This term was coined by Ross (1967 ♦ Ch. 6); an island is^a 
configuration such that certain rules cannot involve an element 



1 
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extemed, and an clement internal, to it. 



namely parentheticals , comparative and equitative clauses, and non- 
clausal adverbials, have been shown not to be reducible to the CNPC- 
case, and thus suggest rather strongly that, given R6ss» franework 
observational adequacy requires the imposition of a fifth constraint 
over and above the CNPC, the CSC, the LBC, and the SSC, which we 
may call the Optional Adverbial Constraint (or the OAC) , and whose 
statement would have to be something like (1.1*5). 

(1.1*5) The OAC: No element of an optional adverbial^ 
(i.e., a VP- or S-modifier) can be moved from 
xinder.the dominance of the S-node which 
immediately dominates the node ADV (or the 
corresponding node IIP, in a grammar which does 
not employ the symbol ADV). 



"Adverbial" is here used in the narrower sense, i.e., not 
including the connector. 



I cf* rather easy to see that the three cases which required 
U.'*5) cannot be disposed of by the A-over-A Principle either. 
Indeed, the migration of HP's from parentheticals could not be 
blocked, for there is no reason to posit a node tiF over the parenthe- 
ticals; the blocking of movements out of comparatives and equitatives 
would, of course, depend on whether such clauses are dominated by 
the category NP or not; as for cases like (1.1*1), the contrast in 
acceptability between the a and the b sentence must go unexplained. 

In the alternative analysis I will develop in Chapters Three 
and Four, I shall argue that the OAC is an unnecessary addition to 
the grammar, and that the three cases which seemed to argue for 3ts 
introduction follow automatically from more general principles 
of analysis. 

1.2.1*. 2. In addition to the four constraints Ross proposes as 
substitutes for the A-over-A Principle, he also outs forward two 
constraints limiting the scope of movement transformations in 
general. To see why that was necessary, ve shall consider how case 
U.15CJ of the A-over-A principle is handled in Ross* framework. 
In section 2.3, Ross proposes to account for the unaccentabUity 
of (1.18) with the CNPC. Indeed, assuming that the stage 'in the 
derivation of (l.l8) which precedes Extraposition-from-IlP is (1.1*6), 
we can see that in order to derive (l.l8), we must move R3— an 
element of S2— out of NP^, and the latter is a Complex NP whose 
modifying clause is 83. 




a proof that the claim that John had lied had been made vas civen 



However^ if HP. were not a Complex NP^ G3 would still be prevented from 
extraposing "too far"» as (l.l*7b) c^mnot be derived from (l,l*7a). 

(1.1*7) a. That the claim ihat John had lied had been 
, made was obvious . 
. . b.^ *Tliat the claim had been made was obvious 
that John had lied. 

It seems rather counterintuitive to claim that Extraposition and 
Extraposition-from-KP are constrained by different principles » and 
nosi> proposes, in Chapter Five, that the notion of bounding be aOded 
to the theory of granaiar* Thus, a nxie "upward bounded if its 
domain is restricted to whatever is dominated by the "first sentence 
up". Ross further proposes a putatively universal constraint, the 
Right Roof Constr aint (henceforth, the RRC), which easily accounts 
for (1.18) and (fn47b) . 

(l.J*8) The RRC : Rules which move elements to the right 
are upward bounded. 

; 

The RRC^severcly limits rightward movements. While leftward 
movements are not, in general, limited in scope, they are neverthe- 
less limited in "direction", for no element can move out of its 
"strip" (or "highest island"). The highest island of an element Y 
is the set of all elements which command Y (the notion command was 
developed by Langacker, and, informally^ asserts that X commands Y 
if X is in a sentence which dominates Y). Thus, it turns out that no 
movement transformation is completely free, and Ross proposes an 
additional restriction which he called elsewhere the Hiphest Island 
Constraint (henceforth, the HIC ) , and which can be formulated as 
follows : 

(1.1*9) The HIC : A transformation can only move an 
element to a position which commands that 
element •s initial position. 



1.2.5. In Chapter Six of. his dissertation, Ross points out 
that chopping niles are not the only ones which are subject to his 
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various constraints and that feat^ire-changing rules and sooe 
deletion rules also have this property. For example, (l.SOa), is 
claimed to be bad because the rule which changes some to any 

II ff ?nM"^**?''? clause from outside the c^Jit^ninrcomolex 

* «nd (1.50b) IS claimed to be bad because a deletion rule hw 
committed the same offense. 

(1.50) a. «I never told you I had seen the nan who 

.ever fought against anybody, 
b. »Jill is prettier than you are aware of 
the fact that Mrry is. 

On the ot'her hand, Ross notices that not all unbounded rules 
are subject to island constraints. For example, copying ^ules 
pronoifiinalization rules, and certain deletion rrkes are n^?, as 
shown by the acceptability of (l.51a).(l.51c) , in which ?he 'respective 
rules reach inside complex HP's in the same way in which feature- 
changing and deletion rules do in (l. 50). 

(1.51) a. Who did I show you a girl who likes him? 

Cdialectally restricted^ 

b. John dislikes all the girla who have 

chased him. 

c. You can certainly use a rifle, but I know 

a girl who tfiinks you can't. 

Ross attempts to make sense of the situation by means of the 
S^i^S^?! f Jhich I shall call the Dichotomous Behavior 
Principle (henceforth: the DBP ): 

(1.52) The DH?:^ Chopping rules, feature-changing rules, 

and unidirectional naes of pronominalization 
obey the CNPC, CSC, LBC and RSC; bidirectional 
rules of pronominalization and copying rules 
do not. 

The following two claims made by the DBP (the first, implicitly, 
the second, explicitly) are worthy of attention: 

(1.53) a. Deletion is a subcase of pronominalization, 

and both phenomena behave alike; 
b. The criterion which determines the applica- 
bility of island phenomena to specific 
rules of pronominalization is whether the 
rules at issue operate unidirectionally 
or bidirectlcnally. 

Thus, the difference in acceptability between (l.5Ua) and (l.5l«b) 
must be accounted for by arguing that the rule which introduces the 
underscored pronoun in (l.5lia) is the rule of pronominalization 
whicn operates in other environments, while the rule which deletes 
that pronoun to yield il.5k\>) is a special deletion rule restricted 
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to this specific environment. 

(1.51*) a. The mink coat is ready for you to find a 
girl willing to put it on. 
b. *-The mink coat is ready for you to find a 
girl willing to put on. 

Notice that the only reason for saying that the deletion rule is 
unidirectional is that the for-to clause cannot prepose; the non- 
existence of a preposing rule does. not lead Ross to the claim that 
the pronominalization rule which yields (l.5^a) is unidirectional. 

Observe now that the rule which yields ( 1.50b) is clearly the 
same rule which yields (l.55a) and (l.55b), that is, VP-Deletion. 

(1.55) a. Although her brother is, I am afraid that 
Mary is not efficient enough, 
b. Jill is conceited, but I don't really know 
a girl who isn't. 

To make this proposal work, Ross woiad be forced to claim that the 
rule of pronominalization which yields (l.5^a) is the same rule 
which pronominalizes in other environments, while the VP-Deletion 
rule which operates in comparative constructions like (l.50b) is 
differen t from the VP-Deletion rule which operates in non-comnarative 
environments, since tiie latter is (i) bidirectional, and (ii) exempt 
from island-constraint??, as shovn .by (l.55a) and (l.55b), respectively 
Clearly, such a position is inconsistent. 

In connection with the DBP, Ross writes,' "it is at present a 
total mystery ... why unidirectional pronominalizations should obey 
the constraints..." However, no mystery is involved, since the 
distinction Ross makes between unidirectional and bidirectional 
pronominalization is almost certainly irrelevant. Thus, if we 
examine (1.5^) more carefully, it becomes apparent that its unaccept- 
ability not only has nothing to do with unidirectionality , but has 
nothing to do with deletion in the first place: The relevance of 
deletion to comparative formation^ was probably assumed because 



^Chomsky (1971. note 29) also assumes the relevance of 
deletion in this case, and proposes that the comparee is in fact 
moved by cyclic hopping until it is adjacent to than, after vhich 
it is deleted. Thus, the island constraints found with comparative 
constructions are blamed on an invisible chopping rule: This 
proposal is untenable for the same reasons as Ross*. 



identity between the 'comparator* and the 'comparee* requires the 
deletion of the latter in English (although not, for example, in 
Japanese). But it is not necessary to resort to Japanese in order 
to expose the irrelevance of deletion, for English has comparative 
constructions like (l*56\ in which no deletion occurs* 
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(1.56) «^o^^is^taller than it has been assu^^^ 

LTLltc!'ara^^^^^^^^ to the C.PC, CSC, 

(1.57) " a. «John is taller than i/'know a boy- who is fat 

that Bill is fat }> °' '-'^ 

»John is taller- than Bill i<5 fo+ 

full-time. . employed 

c. »John is taller than that Bill is fat is 
clear. 

^* il J^'"" ^^11 

The problems raised bv fl S7^ f'ny +k« 

Positic in general vili tien ^n i°^h""^ 1°' Transformational 
to point out at this sta^lhS th^i J section. Suffice 

Deletion in (1 50b) ,-= 3 \. f' ^'""Sly suggest that VP- 
There reL^^'hive^S ~ro?"-°°f"'= "nacceptaMlity. 

(1.58) a. This rock is too heavy for you to find a 
V, »™- to Dick it up. 

b. »This rock is too heavy for you to find a 
ooy willing to pick up. 

I suggest that what distinguishes +h*> +«« «i 

is that in one class of deL^^n . -^^^^ °^ deletion rules 

to an Obligatory deen ^^^^^^^ *° structures subject 

Clause whxch cS^tains' tL" dlletee ^^ SHgnt^i'"' 

deletor. while in the other class ^ases^^ucr''^'":-' °^ 

and consequently no deletion need occ^ in +^ condition exists, 

structures. The pertinent constlaLt which I ^°iTdT'^" . 

propose an explanation for in Chapter 'fS! is a'59)!''"^ 

(1.59) Deletion rules are subject to island 

constraints if and only if there exists 
an independent condition requiring that 
an element coreferential with the deletor 
occur within some configuration which 
contains the deletee. 

const^'el Eelativization is 

level of representation; in contrast, rSles Ske ^""^ 
not coupled with such conditions, aa'c^^: l^l Zt^TiZi^ 
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(l.6o), which' is identical with (l.55b) in-^all respects except for 
the fact that the identical VP's in the latter are not identiceJ. 
in the former, and no deletion consequently tcikes place. 

(1.60) Jill is conceited, but I don't really know 
a girl who isn't in love with herself. 

With respect to {1.3^) and (I.58), it will be shown in Chapter Four 
that the occurrence of a coreferent of the subject of the main verb 
within the for-to clause is an inference 'invited' by a semantic 
condition requiring that a resxiltative relation obtain between the 
main and the for-to clauses; this Invited inference (for a definition 
of this term, see Geis and Zwicky (l97l)) has in general the same 
effects as a strictly linguistic condition, an interesting but not 
isolated fact (l will show below other instances of syntactic form 
being determined bjr invited inferences or by 'conversational 
implicatures' in the sense of Grice (1968)). 

It should be noted that (1.59) refers to strict deletion, not 
to pronominalization in general, end therefore predicts that only 
the b-, but not the a-seritences in (I.5I*) and (-1.58) are bad. 

1.2.6. It is important- to notice at this point that if (1.59) 
is correct, the Transformational Position is observationally 
inadequate.' The reason is that the conditions which dictate or 



I cm assuming a transformational grammar which does not allow 
arbitreury^ coding symbols. If such symbols are tolerated, global 
rules can of course be avoided. 



invite the inference of the occurrence of two coi'eferential NP's 
are semantic, and consequently belong in deep stvucture. On the 
other hand, the pertinent deletion processes are probably very late 
ones, and belong to a stage very close to surface structxxre. There- 
fore i an adequate statement of (1*59) within the theoretical frame- 
work which Ross espouses (Generative Semantics) reqtiires the employ- 
ment of a global rule. 

The claim that island constraints should be reformulated as 
global rules has in fact been made by Ross himself in Ross (1969), 
where it was pointed out that there is a deletion rule. Sluicing, 
which, when deleting the remainder of an isl€tnd out of which an 
element had been moved by Question-^Iovement , yielded more acceptable 
sentences than those which would have resulted had Sluicing not 
applied. The point is demonstrated in (I.61) for the CNPC, CSC, 
SSC and OAC. 

(1.61) a. ?I know you kissed a girl who had received 

a coat from someone, but I don't know who 
(*you kissed a girl who had received a 
coat from) . 

b. ?Bill and someone were dancing, but I don't 
know who (*Bill and vere dancing). 
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(1.61) c. ?That John loves someone is well-known, but 

I don*t know who (*that John loves is 
well-known ) . 
d. ?John left Mary because she had slept with 

someone, but I don*t know who (*John left 
Mary because she had slept with). 

Because of facts like" those in (l.6l), Ross proBOsed the followin^^ 
ainmended formulation of island constraints: 

(1.62) When an element is moved out of an island, 

unacceptability results; if the remainder of 
the island is later deleted, unacceptability 
of lesser severity results. 

Clearly, both the reduction in acceptability due to island-deletion 
and unacceptability due to obligatory coreferentiality coupled with 
deletion, must be global conditions on island- constraints . 

With regard to (1.59), one would want to state it more generally 
than just f*or deletion rules, since there is a definite parallelism ^ 
between the fact that non-null pronominalization does not create 
unacceptability even in the presence of an obligatory coreferentiality 
condition, while strict deletion does, and the fact that coDying rules 
do not create unacceptability, while chopping ones do. In both cases, 
no unacceptability arises when the affected element is reduced to 
a non-null pro-form, and unacceptability does arise when the affected 
element disappears completely. The similarity between the two sets 
of phenomena can be captured more perspicuously if we regard chop- 
ping rules as proceeding in two stages, i.e., copying followed by 
^ deletion (as proposed, for example, in Drachman (1970)). Such a view 
is particularly reasonable for rules like English Relativization or 
Question-Movement, which do not appear as copying ones in the 
standard dialect, but which nevertheless are realized as copying 
rules in many non-standard dialects or even in relaxed styles of 
standard-dialect speakers; it is to be assumed that children exposed 
to both possibilities will formulate the rules in question as coDying 
ones, followed by deletion in specific situations. If so, we may 
notice that copying rules resemble the deep structure coreferentiality 
condition in that they also create two necessary coreferents. The 
similarity between chopping and deletion under obligatory identity 
can now be made explicit: in both cases, strict deletion occurs, and 
in both cases an element coreferential with the one acting as 
deletor must exist . 

The possibility of capturing the generalization just stated with 
global rules depends on the constraints placed on global rules. If 
we require that global rules refer to specific stages in a derivation, 
the generalization cannot be captured, for in one instance it is 
necessary to refer to deep structure and in the other to the output 
of copying. If we do not require globaJ. rules to be specific in 
that sense, the generalization can be captured. 

It is noteworthy that there are a great deal more global 
conditions which need to be imposed on island constraints. One 
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such condition is necessary to account for the ill-formedness of 
the sentences in (1.57), which clearly cannot be blamed on some 
movement or deletion transformation. One may be tempted to impose 
a deep structure condition to the effect that the comparee cannot 
occur in an island if the comparator is outside that island (equita- 
tive constructions work in the same way, as the reader can convince 
himself by replacing taller' than with as tall as in (1.56) and 
(1.57)). However, the pertinent condition cannot be a deep structure 
one,^ because (I.63), which has the same deep structure as (l.57c). 



In Grosu (l972a), it was mistakenly claimed that the pertinent 
condition is a deep structureJ. one. 



is acceptable. 

(1.63) John is taller than it is clear that Bill is 
fat. 

Also, the well-formedness of comparatives seems to depend on certain' 
conditions of formal similarity between the compared elements (l 
have not investigated these conditions in detail, as their precise 
statement undoubtedly warrants a separate study; suffice to notice, 
however, that such conditions exist). Thus, consider the difference 
in acceptability between (l.61fa) and (l.65a), as well as between 
d.61lb) and (1.65b), which have been claimed to be related trans- 
formationally. 

(1.61*) a.- It seems that Bill is clever as much as 
it seems that Joe is stupid, 
b. *John is as willing to please as for anyone 
to offend Maiy ii easy. 

(1.65) a. ?*lt seems that Bill is clever as much as 
Joe seems to be stupid, 
b. John is as willing to please as Mary is 
' easy to offend. 

3 (l.65a) shows that an acceptable deep structure becomes bad by a 

transformation (Subject-Raising), and (l.65b) shows that a bad deep 
structure becomes acceptable by a transformation (Tough-Movement). 
It would, seem that the well-formedness of comparatives and equita- 
tives must be determined by. a shallow structure condition (in the • 
sense of some level which precedes post-cycle deletions) requiring 
some kind of formcuL similarity between the compared VP's (the 
condition cannot be a surface one, since formal similarity cannot 
be determined from the surface in sentences like John is as tall a's 
you think Bill is ). 

The shallow structure condition suggested above is not 
sufficient to determine the acceptability of comparatives or 
equitatives as the latter also depends on the semantic compatibility 
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of the compared phrases* Thus, (l.66a) and (l.66b) are all riirht, 
but (1.66c) and (l.66d) are not. 

(1.66) a. John took to his heels as sviftly as 
Bill fought bravely. 

b. Mary is as pretty as Jill is refined. 

c. *^John took to his heels as swiftly as Jill 

is refined. 

d. ^Mary is as pretty as Bill fought bravely. 

As semantic compatibility belongs in deep structure, it appears that 
the determination of the well-formedness of comparatives and equita- 
tives requires at least the following statement: 

(1.6?) Comparatives and equitatives are ill-formed if 
(i) the compared elements are semantically 
incompatible, or (ii) the comparator is 
external and the comparee is internal to some 
_ island configuration in shallow structure-; 

Notice that the two conditions of (1.6?) are disjoined, and 
there is therefore no' motivation for regarding. (1.6?) as a global 
rule, but rather as a set of two independent conditions. Consequently, 
the island constraints must bear a third condition, but a shallow- 
structure rather than global one, namely (l.67ii). 

It should be pointed out that there is an alternative uay of 
accounting for the ungrammaticality of the sentences in (1.57) without 
recourse to (1.67). Thus, given an analysis of comparative and 
equitative constructions along the lines suggested by M. Geis (and 
other generative semanticists) , as in (1.37), one can blame the 
badness of the sentences in (1.57) on a movement transformation. 
For example, if an earlier stage in the derivation of ( 1.57b) is 
*John is tall to an extent which exceeds the ex tent C Bill is Cfat 

l ^^l exten t and an assistant professor] ] / RelatTvization will 
move the rightmost doubly underscored one in violation of the CSC. 

The above account is certainly a possible one; however, I have 
not adopted it because it requires strong ordering assumptions, 
which do not have enough independent support. Specifically, it 
requires that Relativization apply to the degree adverb before the 
lexical insertion rule which replaces to an 'extent which exceeds the 
extent to which by more than ; this lexical insertion rule must itself 
precede the rule of Comparative Deletion which turns John is brighter 
than Bill is into John is brighter than Bill, for this rule can apply 
only when the comparee is immediately embedded to the comparator, as 
shown by the ungranmaticality of ^ John is brifchter than I told you 
that Bill ; finally, this rule must in turn precede Relativization, 
in order to allow I know a man who John is taller than and to block 
* I know a man who John is taller than is . All this could be done 
by making Relativization, Comparative Deletion, and the lexical 
insertion rule cyclic, an ordering for which I do not know of much 
independent justification (in particular, lexical insertion has 
usually been claimed to take place at the end of the cycle). 



It should also be pointed out that if Relativization and 
lexical insertion are made cyclic, Geis* proposal that the island 
character of adverbials be explained by the CNPC collapses , for no 
adverbial would be a complex NP at the stage at vhich Relativlzation 
moves an element out of it. In other words, Geis could not simultane- 
ously maintain his claims that the restrictions on adverbials are 
reducible to the CNPC and that (l>37a) and (l>37c) are derived from 
(l>37b) and (l>37d), respectively (notice that Relativization Rnri 
lexical insertion would have to be cyclic not only in order to account 
for constructions containing comparatives, such as I know a man who 
John is taller than , but also in order to account for constructions 
containing other adverbials, such as this is the woman who John 
committee suicide because of , and this is the girl who John became 
a priest in spite of )> " " 

The various problems mentioned above had led me to adopt the 
view that the sentences in (1.57) are bad because the semantic 
comparative relation goes into an island,, rather than because of the 
movement of a degree adverb out of that island (even though the 
latter analysis is not untenable, provided that we discard the 
claim that, adverbs are islands by the CNPC). It should be noted 
that ascription of island-sensitivity to semantic relations is needed 
independently of comparatives or equitatives; D. James (1972) has 
discovered quite recently that the relation between interjections 
and their scope is also subject to island constraints, and it is 
highly improbable that an analysis involving movement could be 
devised for the cases she considered. 

Another condition to be imposed on island constraints concerns 
the distribution of neutredized elements like anjr or ever. As Ross 
pointed out, such elements, which must be coninanded by some neutralizer 
(e.g., negation), canno.t occur inside islands if the neutralizer is 
outside the island. This restriction is shown in relation to the 
CNPC, CSC, SSC and OAC in (l.68)-(l.7l) respectively. 

(1.68) *I never said that you met a boy who ever harmed 

anyone. 

(1.69) *I never said that Bill is brave and Mary ever 

hcu:ined anyone. 

(1.70) *I never said that for you to ever harm anyone 

is desirable. 

(1.71) *I never said that Bill loves Mary, because she 

ever harmed anyone. 

At the time Ross wrote his thesis, it was assumed that any is 
introduced by a rule of Indefinite-Incorporation (or Some-Any 
Suppletion), and Ross argued that that rule is subject to island 
constraints. This assumption was later questioned hy Robin Lakoff 
(1969), but independently of that fact, it is hard to see what riae 
could be blamed for the occurrence of ever inside islands. Conse- 
quently, the occurrence of ever , and probably any , must be determined 
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by some condition on a level. There is no reason that I can see 
why this condition could not be stated at the same level as (l.67ii), 
that is, shallow structure, where the distribution of neutratlized 
elements in general could be stated. There is, however, the problem 
that the level of shallow structure has not yet been defined 
precisely enough. Thus, certain writers have sug/jested that the 
output of the cycle be regarded as shallow structure, but such a 
definition wouldn't work if Extraposition is a cyclic ride, for the 
well-formedness of sentences containing neutralized elements must 
be determined after sentential subjects have extraposed. Moreover, 
if Question-Formation is post-cyclic, it must also precede shallow 
structure, as shown by (I.72) 

(1.72) I never heard about Bill's raurderine T^^^ \ , 

^ ^anyonej 

(1.73) Whose murdering ] > did I never mention 

to you? 

Shallow structure cannot, however, follow all movement rules* 
In particular, it cannot follow Topicalization (as the acceptability 
of (I.7U) shows), and it cannot follow Pseudo-Clefting. 

(1.7'*) That Bill ever harmed anyone his sister Just 
can't believe. 

With respect to Pseudo-Clefting, there have been proposed at least 
three competing analyses, and I shall argue at great J.ength in 
2.1,3 that only an analysis which involves the "extraction" of the v 
pseudo-cleft element from a relative clause in subject position ^ 
is tenable. Suffice to point out at this stage that the cleft 
constituent must still be within the complex-NP subject in shallow 
structure if the ungrammpiticality of (l.75a) and (l.75b) is to be 
accounted for by (l.67ii). 

(1.75) a. *Bill is taller than what I told you is that 

Bill is fat. 

b. *I never told you that what you fear most is 
that Mary ever harmed anyone. 

Notice that the unacceptability of (l.75a)-(l.75b) has to be accounted for- 
by some condition like the CNPC, for predicates in construction 
with the copula are not islands in general, as shown by (1.76a)- 
(1.76b). 

I 

(1.76) a. This knife is sharper than what you saw 

yesterday is heavy, 
b. I never claimed that what you told me last 
year was ever surprising to anyone. 

I will not pursue the discussion of shallow structure any 
further, and will content myself with having pointed out that ir» all 
probability some post-cyclic niles must precede, while others must 
follow, shallow structure. 
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It is important, however, to ask whether Pseudo-Clefting can 
also follow all the unbounded movement rules, for those cannot touch 
pseudo-cleft constituents, as (1.77)~(l.79) show. 

(1.77) *V/ho do you think that what John wants is to marry? 

(1.78) *The girl who I think what John wants is to marry 

is sicke 

(1.79) *Jill, I think what John wants is to marry; 

The linacceptability of (l.77)-(l.79) would be explained if Question- 
Formation, Relativization and Topicalization preceded Pseudo-Cleft ing, 
for the pseudo-cleft would still be iuside an. island at that point. 
An account along these lines has in fact been attempted by Poss in 
jr, connection with the island properties of clauses moved by Extraposition- 
from-NP. Thus, Ross proposed that the badness of sentences like those 
in (1.60)-(1.82) be explained by ordering Extraposition-from-NP 
after all unbounded movement rules. 

(l.Bo) *I don't think that the man has been hired who 
ever robbed anyone ♦ 

(1.81) *John is taller than a boy just left who was tall. 

(1.82) ^Which hat has a boy just left who was wearing? 

Unfortunately, a similar solution is iraoossible for the sentences 
in (1.77)-(1.79). If (1.81*) is the input to'Pseudo-clefting in the 
derivation of (1.83) (for discussion, see section 2.1.3), it is 
clear that the extraction of to' marry Jill must precede Relativization, 
for it is only after extraction that the copy left behind, namely 
something , becomes identical id.th the head of the relative clause. 

(1.83) What John wants is to marry Jill. 




The thing John wants to marry Jill is 



Now, in order to explain the badness of (1.85) by the CIIPC within 
the Transformational Position, we woiild have to clain that the lower 
occurrence of Jill is relativized while the circled NP is still 
dominated by the boxed !JP. 

(1.85) *Jill, who what John wants is to marry, is my 

cousin. 

But this is impossible whether Relativization is cyclic or post- 
cyclic. If Relativization is cyclic (by which I mean that it is 
triggered as so6n as the head of the relative clause is reached, and 
not that the relative pronoun » hops' one cycle at a time, as 
suggested in Jackendoff (I969)), then Jill must be relativized 
before the circled HP in (l.81|); in other words, Relativization on 
a higher cycle would have to precede Relativization on a lower cycle, 
with Pseudo-Clefting ordered in between. This would mean a genuine 
anti-cyclic ordering of Relativization (not merely its sequential 
application from the top to the bottom oT the tree), and since no 
one has ever offered any evidence in support of such a position, 
we need not take it seriously. If Relativization is 'post-cyclic, 
which means that all instances of Relativization must apply in 
block (i.e., with no rules ordered between various applications 
thereof), and if Pseudo-Clefting precedes one instance of Relativi- . 
zation, it must precede all instances of Relativization; conseq\M»:itly , 
Jill cannot be relativized before the circled HP is pseudo-c.lc:f ted . 

The only way in which (I.78) and (I.65) can be excluded by the 
CNPC is through a fourth global condition, which I state tentatively 
as (1.86). 

(1.86) Elements which cannot be moved because they 

belong in ah island, cannot be moved even 
after they are no longer part of that " 
island. 

(1.86) can accomplish the desired results with respect to (l.77)-(l.79) 
and (1.85), but it cannot capture the intuition that (l.8o)-(l.82) 
is bad for the same reasons as (l.77)-(l.79) or (1.85), namely that 
an element is moved out of a constituent which belonged to an 
island in remote representation. In other "words, the handling of 
(1.80)-(1.82) by extrinsic rule ordering and of (l.77)-(l.79) and 
(1.85) by a global condition seems to say that the two situations 
are unrelated. 

I will suggest in Chapter Four how the similarity between (1.77)- 
(1.79), (1.80)-(1.82) and (1.85) can be captured within the Behavioral 
Position, but for the moment, we may adopt (I.86), which I think is 
observationally correct. We may notice that (1.86)' has at least one 
interesting consequence, namely, that the level of shallow structure 
referred to in (l.67ii) cannot exist. To show this, we must consider 
the propertiea implicitly or explicitly ascribed to that level. The • 
first one (l.07i) is definitional: 
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(1.87) (i) Shallow structure is not earlier than the 
output of the cycle. 

The second property follows from (l.87ii) and o*^r subsequent 
discussion: 

(1.87) (ii) Shallow structure is the level at which the 
correct distribution of neutralized 
elements (i»e., comparees, any, ever ^ 
etc. ) is stated. 

The third property follows from (l.72)-(l.73) : 

(1.87) (iii) Question-Formation precedes shallow 

structure • 

The fourth property follows from the well-formedness of (1.88) 
(in some dialects) and of (I.89) (in all dialects); with respect to 
the latter, its relevance depends on our accepting Alonajian's claim 
(1970) that clefts are derived from pseudo-clefts: 

(1.88) What do you think that what John prefers is? 

(l»89) What is it that you think John prefers? 

(1.87) (iv) Question-Formation follows shallow 
structure . 

Since in both (1.72) -(1.73) and (l*88)-(l.89) Question-Formation 
applies last- or post-cyclically, both applications take place at 
the same stage and no level can exist in between • I conclxide that 
a level with the properties (l.87i-)-(l.87iv) cannot exist. ^ 



%otice that we can use an argument of the same type for 
arguing that the rule which derives clefts from pseudo-clefts is 
not Extraposition, as Akmajian claimed. Indeed, (l«89) shows that 
Question-Formation must follow Akraajian's 'Extraposition'; but 
the difference in acceptabili-cy between (i) and (ii) shows that 
^ Question-Formation must follow Extraposition, or (ii) will be 
marked as ill formed by the SSC. 

(i) *Who is that John loves clear? 
(ii) Who is it clear that John loves? 

"As Akmajian 's claim entails a cyclic ordering of Question-Formation 
and Extraposition within the last- ;or post-cycle, the rule which 
derives clefts from pseudo-clefts cannot be Extraposition. 



The moral of this argument is not necessarily that a level of 
shallow structure cannot exist in principle, but that the distribution 
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of neutralized elements cannot be stated at some well-defined level 
of linguistic structure,-'-^ 



■^•aiotice that even if the distribution of neutralized elements 
were stable in shallow structure, it would still be highly unnatural 
for island constraints to hold at that level for neutralized 
■elements, but at different derivational stages in general. The*=3 
rea^son is that island constraints can certainly not hold in shallow 
structure in general, for this level would then have to possess 
additional contradictory properties such as the following: 

(1.87) (v) Pseudo-Clefting and Relativization follow 
shallow structure, 
(vi) Pseudo-Clefting and Relativization precede 
shallow structure. 

(1.87v) follows from (l.67ii) and the txngraimnaticality of ( 1.71a) 
and (1.71b); (l.87vi) follows from the fact that Pseudo-Clef ting 
and/or Helativization may not touch islands, as shown in (i). 

(i) *vrnat I told you about a boy who saw was the house. 

The point I am trying to make is that it would be highly unnatural 
for a set of nhenomena to constrain sometimes transformations and 
sometimes a given level. 



We have so far noted four situations (in connection with (1.59), 
(1.62), (1.67), and (1.86)) which cannot be .handled within the 
transformational Position. Two more such situations will be discussed 
in this section. 

In his discussion of the CNPC, Ross (1967) pointed out the 
•difference in acceptability between (l.90a) and (l.90b) (for most 
speakers, (l.90b) is intermediate in acceptability between (l.90a) 
and (1.90c), but Ross did not discuss this fact). 

(1.90) a. ^It^s Mary I discussed the claim that Bill 
likes. 

b. ?It*s Mary I made the claim that Bill likes. 

c. It^s Mary I claimed '^that Bill likes. 

Noticing the synonynQr between (l.90b) and (l.90c), Ross suggested 
that the former be derived from the latter by a rule replacing claim 
with make the claim > which would follow all the post-cyclic unbounded 
movement rules, so that no island configuration should exist when 
the unbounded movement occurs. However^ Ross pointed out that he 
could not provide a satisfactory solution within the Trans formationcLL 
Position, since the lexical replacement rule must also precede the 
various unbounded rules ; the reason given was that those rules must 
be preceded by Passive, which must in turn be preceded by the 
lexical replacement rule, in order to allow the derivation of the 



^3k ^ 



c laim that John likes Mary has ailready been m a de by a nunber of 

people > Of course, the ordering paradox noticed by Ross can be handled 

within the Derivational Position with a statement like the following: 

(l.yi) Violations of island constraints are considerably 
reduced in severity if the affected island was 
not an island in xmderlying representation. 

The last problematic case for the^Trans format ional Position I 
wish to discuss concerns the rule of (non-emphatic) Reflexivization* 
Ross claims that Reflexivization is subject to island constraints, 
because it cannot violate the CSC (the other constraints are 
inapplicable, because Reflexivization is both upward and downward 
bounded, i.e., the reflexive and its antecedent must be clause-mates). 
Ross supports his claim with the following examples: 

(1.92) a. *Bill understands Mary and himself. 

b. *Bill and Maiy washed himself. 

c. *Andy pinched Sarah and tickled herself. 

d. *The gun and a description of itself lay 

on the table. 

Notice that the above examples are relevant within the Transformational 
Position only if Coordination-Reduction precedes Reflexivization. If 
Coordination-Reduction follows Reflexivization, then the input to 
the former in the derivations of (1.92a)-(1.92d) vill be (l.93a)- 
(l-93d). 

(1.93) a. Bill understands Mary and Bill understands 

himself. 

b. *Bill washed himself and Mary washed himself. 

c. *Anay pinched Sarah and Andy tickled herself. 

d. *The gun lay on the taole ?uid a description of 

itself lay on the table. 

It can be seen that, given the ordering Reflexivization ^ Coordination' 
Reduction, the b-d sentences in (1.92) are bad Independently of the 
CGC, and the badness of (l.92a) cannot be accounted for by the CSC, 
for the violated island does not yet exist at the stage at which 
Reflexivization takes place. Therefore, Ross* claim depends on the 
ordering Coordination Reduction - Reflexivization being the correct 
one. 

Let us begin by contrasting the ill-formed (1.92a) with the well- 
formed (I.9U). 

(1.9^) John admires Mary and despises himself. 

In order to account for both ( 1.92a) and (1.9^), Ross would be 

forced into the highly unnatural position that some instances of 

Coordination Reduction precede, while others follow, Reflexivization. 

This unnatural position could be avoided by regarding the two parts n 
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Of the CSC (i.e., the one which refers to entire coordinate terms 

to parts of coordinate terns) as essentially 
independent restrictions (in Chapter Three I will argue that this is 
precisely the correct conclusion), and that Refiexivisation is 
subject along with a number of other prononinalization phenomena. 

It remains to decide on the ordering of Reflexivization and 
■ Coordination Reduction. Reflexivization must be cyclic, if the 
grammatical (1.95) is to be generated. 

(1.95) .Mary believes herself to have done harm to 

herself in her sleep. 

Coordination Reduction- must be either cyclic or Tsost-cyclic but • 
not pr«yclic; because it must follow the cyclic" rule of Passive 
as (1.96) suggests. • * 

(1.96) John hit Mary and was hit 'by Bill. 

Making the more con3e.rvative assumption that Coordination Reduction 
IS cyclic, let us consider the derivation of (l,92a). Its source 
must be something like (1.97). 

(1.97) . Bill understands Mary and Bill tuiderstands Bill. 

v?o?J/*^M*c^o'f if °" *he first cycle, 

yielding (1.93a); Coordination Reduction cannot anply on the first 
cycle, because its structural description is not met, and must there- 
fore apply on the second cycle and to the output of Refl«.vivta«f.4nr, 

Consequently, the unacceptability of ( 1.92a) cannot be blamed on 

the CSC within the Transformational Position, and a sixth r^obal 
condition like (I.98) seems to be call-id for. 

(1.98) A rule subject to island constraints is blocked 

even when it affects a structure which is not 
internal to an island, if that structure 
becomes internal to an island after the 
application of the rule in question. 

We have considered six non-transformational conditions on 
island constraints, namely (1.59), (I.62), (I.67), (l.Sb'), (1.91) 
and (1.98), and have seen them to-be necessary for observational 
adequacy purposes. There is no reason to believe that the list is 
closed, but even if it is, there are sufficient grounds for rejecting 
the Transformational Position as observationally "inadeouate. 

VJith respect to the Derivational Position, we have seen that it 
was observationaUy adequate for the cases considered, and we may 
credit It with observational adequacy in handling, other similar 
cases, if such should arise. On the other hand, there is no reason ' 
for believing that the Derivational Po5?ition can do more than give an 
observationally adequate accoxint of the facts. The Derivational 
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Position cotild be claimed to capture an important generalization 
if the six global conditions I have listed* coiild be shown to form 
a natural class. However, there is no way that I can see in which 
this can* be done. (1.86)^ and (l.9l) might suggest a generalization 
like'»a configuration counts as an island if and only if it has that 
status in underlying representation', but this biconditional state- 
ment is falsified both ways by (1.62) and (1.98); (1.62) shows that 
the violation of a configuration with island status in underlying 
representation becomes immaterial due to something which happens at 
a later stage, and (1.98) shows that a configuration with no* island 
status in underlying representation can impose island constraints 
due to something that happens later on. Until someone can show that 
the list of global conditions given above forms a natureuL class, we 
can regard the Derivational Position as descriptively and explana- 
torily inadequate, since the statement of the above global conditions 
on island constraints does make the implicit claim that these 
conditions form a natural class. 

In Chapter Four, I will argue that tne phenomena described in 
this section by global conditions follow in a natural way' from 
independently supported assumptions made by the Behavioral Position. 

1.2.7. Before concluding our review of Ross' dissertation, it 
is worthwhile to look at the most important questions Ross considered 
his thesis had raised without answering: 

(1.99) a. Why should rules which adjoin terms to the 

right side of a variable be upward bounded, 
and not those which adjoin terms to be 
the left of a variable? 

b. Why should it be that chopping rules, 

feature-changing rules and unidirectional 
deletion rules share the property of 
being subject to the constraints, to the 
exclusion of other rules? 

c. Why should there be a difference between 

unidirectional and bidirectional pro- 
nominalization? 

d. Why should complex NP's, coordinate nodes, 

sentential subject clauses and NP's on the 
left branches of larger NP's all function 
the same in defining islands? 

e. Can islands be shown to behave like psycho- 

linguistic entities? 

A large part of the remainder of this thesis will be devoted 
to arguing that the answer to (l.99e) is yes. With respect to 
(l.99a)-(1.99d) , I believe they rest on mistfdcen assumptions and 
should be reformulated before they can be answered. I will discuss 
( 1.99a) in 2.1.3 and show that the pertinent constrednt is much 
weaker than Ross though. Concel-ning (l.99b) and (l.99c), I have 
already shown that xinidirectional and bidirectional pronominalization 
are irrelevant notions with respect to island constraints; moreover, 
I have also pointed out that the notion ^feature-changing rule' 
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shpiild probably be rephrased as 'neutralization', to cover at least 
the comparator-comparee relation, which has nothing to do with 
feature-changing. Finally, I shall argue in Chapter Three and in 
Chapter Four that only conrolex NP's, coordinate nodes, and sentences 
or verb phrases modified by optional adverbials function the same in 
defining islands, as they are the only configurations the constraints 
on which are descriptively collapsible and therefore capable of 
receiving the same explanation , Once Ross' questions have been 
appropriately reformulated, the answer to them will essentially 
consist of the answer to (l,99f), 

(1.99) f. Kow do islands behave as psycholinguistic 

entities? 

1.3. There are two more proposals vithin the Transformational . 
Position which we shall consider in this chapter. The first 
represents a reaction of Chomsky's to Ross* criticism of the A-over-A 
principle; Chomsky attempts to defend his principle within the 
framework of his Extended Standard Theory, The somewhat .-lore elegant 
formulation of the principle given in this paper is reproduced 
below as (l.lOO), although the latter is, as far as I can see, a 
notational variant of (l.l*). 

(1.100) If a transformation applies to a structure of 

the form 

Coi .... ....3 ....3 

where a is a recursive node, then it must 
be so^ interpreted as to apply to the maximal 
phrase of the type A. 

1.3.1. Chomsky writes that "it would be a velcome result" if 
a "careful formulation of the A-over-A condition", together with a 
set of conditions to be examined below, would turn out to be capable 
of adequately handling the range of examples earlier treated as 
conditions on transformations, for the A-over-A principle seems to 
be an intuitively nature^ one, while "such conditions as the Complex 
No\m Phrase Constraint seem quite ad hoc".- And indeed, it is not 
difficult to assent to the claim that if Ross' treatment represented 
a step forward over the A-over-A principle in terms of observational 
adequacy, it looked very much like a step backwards in terms of 
descriptive adequacy. However, as no account can be descriptively 
adequate without also being observationally adequate, it will be 
interesting to consider in some detail the ways in which Chomsky 
proposes to make his analysis observationally adequate. 

Earlier, I mentioned two kinds of cases in which the A-over-A 
principle turned out to be too strong; the two kinds were illustrated 
by the examples (I.6) and (1.7), (l.lO), (1.13) respectively, which 
I reproduce below for convenience: 

(1.6) You approve of my seeing someone. 

(1.7) You saw the picture of the master of the house. 
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(l.lO) I believe that John claimed that Mary insinuated 
that Bill was guilty of murder. 

(1.13) John decided to try to begin to write a book. 

In connection with (1.6), where the NP someone can be wrenched 
out of the containing HP m y seeinjg someone , Chomsky has in fact no 
solution. He proposes to weaken the A-over-A principle from the 
status of a universal constraint to that of a part of an evaluation 
measure on the complexity of grammars . Thus , who would yo.u approve 
of my seeing is merely a 'marked' form, rather than an ungrammatical 
one, since the violation of the A-over-A principle does no longer 
necessarily result in ungrammaticality. What this account leaves 
unexplained is why the above marked form should be perfectly 
acceptable, while other instances of violation of the principle 
should be totally unacceptable. 

In connection with (l.7),> (l.lO) and (1.13), where any of the 
lower ImT's, S's, or VP's can be fronted, Chomsky claims that the A-- 
over-A principle is not in fact involved. His explanation runs as 
follows: there are, among other rules, the cyclic rules of wh-placement 
and of wh -movement ; the rule of wh-movenent refers only to constituents 
that carry a wh (in Chomsky's system, topicalized elements also carry 
a w;h-like feature); therefore, the chopping rule of wh-movement is 
unambiguous, and the A-over-A principle does not need to be invoked. 
However, as Chomsky nowhere says that the A-over-A principle constrains 
only chopping transformations, it is not at all clear why it allows 
the placement of jrti on the lower A's. Wh-placement may be regarded 
as a feature-changing transformation, but I think it is more correctly 
described as the overt manifestation of a neutralization phenomenon, 
of the type discussed already; in the case of relative wh-words , the 
neutralization involves conferentiality, in the case of interrogative 
wh-words, it involves the 'target-^ of a question-verb (in the 
performative analysis, this description covers direct r|.uestions as 
well).^ Such neutralizations are subject to island constraints, as 
we saw above, 'V wh-neutralization is no exception. To see that, 
we must consider a situation in which wh-words cannot move; this etrises 
when there are more wh-interrogatives than question verbs, as in (l.lOl). 
♦» 

(l.lOl) a. Someone went to Paris and to some other place, 
b. *Who went to Paris and to which other place? 

The fact that (l.lOlb) cannot be derived from (l.lOla) indicates 
that ^^-neutralization must be cons^^rained by the A-over-A Principle 
or some equivalent device. 

1.3.2. By denying that wh->neutralization is subject to the 
A-over-A Principle, Chomsky must impose a whole new list of conditions 
on the rules which move wh-like words; the applicability of the A- 
over-A Principle is, of course, severely restricted by this decision. 
On the other hand, the A-over-A Principle becomes applicable in a 
new range of cases , which did not require any meta-grammatical 
constraints in Ross' framework, due to some rather radical changes 
in Chomsky's theoretical framework. For example, due to the elimination 
of the Subject-'Raising rule and to the definition of the structural 
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index .of the passive in terms of categories rather than of relations, 
the A--over--A condition becomes important in preventing ungrairmiatical' 
outputs of the Passive transformation. 

In order to block the movement of one term of a coordinate node, 
Chomsky resorts to a convention proposed by Dougherty to the effect 
that feature complexes are not assigned to coordinate terms directly, 
but rather to the coordinate node first, after which they 'percolate* 
into the coordinate terms. Since a feature like wh has to be assi/3:ned 
both to coordinate NP-node and to the NP--nodes that it immediately" 
dominates, the A-over-A principle effectively block the move-- 

ment of one of the latter. It is not clear, however, how the move-- 
ment of ^ element of a coordinate term will be blocked. If the 
coordinate node is an S, it will be rather artificial to assign it 
a wh, in order that some IIP .inside the coordinate S's be eventually 
marked with this feature. Moreover, it will be necessary to allow 
the •across-the-board* movement of elements in violation of the 
A-over-A principle. 

It is not very clear how the movement of the head of a 
relative clause can be prevented, so long as neutralization manifested 
by wh-placement is not subject to the A-over-A condition. Presumably, 
the percolation-convention would have to be extended to complex NP*s, 
requiring that the head can only receive a wh if the higher NP node 
received one as well. 

With respect to the* Left Branch Condition, Chomsky suggests in 
note 10 that it be made part of the larger condition which blocks 
the movement of • specifiers of NP's and AP's (determiners and pre- 
adjectival adverbial modifiers respectively). This more general 
formulation, as well as the separation of the freezing of specifiers 
and of Complex-NP-heads (which were both collapsed under the LBC in 
Ross' treatment), seems to me essentially correct; in fact, I will 
argue that it was wrong to blame the freezing of complex NP heads 
on the LBC. 

1.3.3. A large number of restrictions are taken care of by 
the following rather complex constraint: 

. (1.102) No rule can involve X, Y in 

...X..SQi...Z... - WYV . . . D . . . 

where (i) Z is the subject of WYV and is not 

controlled by a category containing 
X 

is a subject phrase properly 
containing 

is in COM? and X is not in COMP 
is not in COMP and a is a tensed S 
is in COMP and is not in an 
adjacent cycle. 

1.3.3.1. (I.102i) concerns a number of cases which were not 
discussed in Ross* dissertation, some of which, Chomsky claims, have 
the ultimate effect of redutlng ambiguity in language. These cases 
will be briefly considered in Chapter IVo. 
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1.3.3.2. (I.102ii) is in effect equivalent to Ross* Sentential 
Subject Constraint, except that it does not allow for the optionality 
of Pied-Piping in subject position^ and thus rules out the grammatical 
(1.103b). 

(1.103) a. A picture of John hangs in the dining-room, 
b. It*s John who a picture of hangs in the 
dining-room. 

1.3.3.3. (I.102iii) presupposes Bresnan's Complementizer 
Substitution UniversguL . In this view, all sentences in all languages 
are generated in the base with a node COMpH (for complementizer ). 



Notice that this proposfiuL comes very close to a tacit 
acceptance of the performative analysis, since it is hard to see 
what other interpretation could be given to the fact that main 
clauses are introduced by COM?. 



which is necessarily lexicfidly empty. This node may appear to the 
right or to the left of sentences, and it is only in the latter case 
that transformations can insert lexical material into COM?. The 
lexical material that can be so inserted consists by and large of 
wh-like words (i.e., relativized, questioned or topicalized 
constituents in various languages). The condition that COMP-substitution 
transformations cannot operate if the COM? is clause-final is in fact 
equivfident to Ross* Right Roof Constraint. If COMP is not filled by 
a wh-phrase, it is optionally filled by that (incidentally. Dean (1967) 
pointed out that verbs like quip , croak , roar , etc., require a that 
complementizer). What (l.l02iii) does is ensure that the rxiles that 
Ross views as unbounded and that Chomsky regards as proceeding by 
cyclic 'hopping* will only adjoin a chopped constituent to the left 
of sentences. 

1.3.3.'+. (l.l02iv) prevents elements from escaping from tensed 
interrogative clauses. Thus, in both (l.lOU) and (1.105) Y (in these 
instances, what ) cannot be in COMP, for the COMP position is filled 
by where and how respectively, and the movement of what is blocked 
on the inner cycle. On the outer cycle, however, what can escape in 
(1.105), but not in (l.lOU), for, in the latter, it originates in a 
tensed clause. 

(LlOU) *What did he wonder where John put? 

(1.105) What crimes does the FBI know how to solve? 

In notes (25) and (26), Chomsky admits that (l.l02iv) is too strong 
for many speakers in blocking movements out of all tensed interro- 
gative clauses (some speakers accept What crimes did he wonder how 
they solve d) and that (l.l02i) is also too strong, as it stetrs (1»105). 

From, the way in which (1.102) is formulated, it is not clear 
whether or not the set (i-v) is intended as an exhaustive list of 
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conditions; that is, it is not clear whether the where that nrecedes 
(I.102i) is to be interpreted as an if and only if or merely as an 
if* Given the fact that (1.102) is the last of a number of 
reformulations, we can assume that an if and only if- statement was 
attempted, and infer from (l*102ii) and (l*102iv) that if X is in 
COMT and Y is in COMP, the result will be grammatical. In this 
connection, I should like to point out a rather interesting counter- 
example from Rumanian. 

In Rumanian, it is possible to move more than one questioned 
constituent in COMP^iiosition' in the same, clause. Thus, the following 
sentences are all grammatical: ^ 

(1.106) Cine pe cine a intinit? 
Who whom met? 

"iVho met whom?" ^ 

(1.107) Nu §tiu cine unde a plecat. • 
I don*t know who where went, 
"l don*t know who went where." 

(1.108) Ti-ara spus deja cine pe cine cind a ucis* 

I have told you already who whom when killed* 
"I have already told you who killed whom when." 

Apparently, the base rules of Rumanian generate more than one 
clauses-initial COMP-node. One might attempt to maintain the one-- 
COMP-per-S hypothesis by proposing that the various 'wh-words* are 
in effect coordinated and that the coordinate node is inserted under 
the COiMP-node. Such a hypothesis would be, however, disproved bv 
(1.109) and (1.110). 

(1.109) *Nu stiu cine cind unde a murit. 

I don^t know who where when died, 
"l don^t know who died where when." 

(1.110) IIu stiu cine unde cind a murit. 

I don^t know who where and when died. 

Apparently, the COMP-nodes of Rumanian, have to be marked for the 
category they can accommodate, and there is only one COMP for 
adverbials. Thus, (l.llO) is grammatical, for only one (coordinate) 
adverbial node has been inserted, while (I.I09) is ungrammatical, 
because one of the adverbs has failed to find an empty COMP-node. 
This strongly suggests that there is more than one COMP-node in 
(1.106)-(1.108) , and that the various questioned constituents in 
these sentences are not coordinated. 

Another fact which suggests that the equivalents of wh-words in 
( 1.106)- (1.108) are not coordinated is the existence of the following 
minimal pair:!^ 



I am indebted to my wife, Mariana Grosu, for this observation. 



(1.111) a. Nu stiiT^c ine -ne cine a prins. 

b. Nu §tiu* ^ine. si pe cine , a prins. 

In (1.111b), the wh-vords are overtly coordinated, and the a- and b- 
sentences are not quite identical in meaning, (l.llla) translates 
rouchly as (1.112a), vhile (l.lllb) translates roughly as ( 1.112b); 
in the a^-sentences , the speaker asserts that he knows one i*act, 
while in the b-sentences he^asserts that he know3 two facts. 

(1.112) a. I don*t know who caught whom. 

b. I don*t know whoi cauf^ht someonej , and I 
don't know whorij he^ caught. 

As pointed out above, it would follow from Chomsky •s principle 
that a wh-phrase that has moved into COMP-position can move upwards. 
But this is not the case for Rumanian, as the movement of any of the 
\^-words in (1.108) resiilts in total ungrammaticality ; this can be 
seen in (1.113)-(1.115 ) (the last is grammatical, but only if cind 
Vhen* originates in the matrix sentence). 

(1.113) ^Cine ti-am spus pe cine cind a ucis? 

\n\o did I tell you whom when killed? 

(l.lll*) *Pe cine ti-am spus cine cind^a ucis? 

did I tell you who when killed?^ 

(1.115) Ctnd ti-am spus cine pe cine a ucis? 

When did I tell you who whom killed? 

What these facts suggest is that the explanation of the 
restrictions on chopping out of interrogative clauses in terms of 
wh-hopping is mistaken. In addition, it still remains to give an 
observationally adequate account of those cases in which interrogative 
clauses function as islands, since neither Chomsky nor Ross have done 
anything of the kind. Perhaps, the solution -to this difficult problem 
shoxild be sought not only in formal features, like the presence or 
absence of tense in the interrogative clause, but also in the 
semantics of each interrogative word. The matter remains open, for 
the time being. 

Since Chomsky requires that COMP dominate a null lexical string 
in the base, I assume that he would not want to claim that 
subordinating conjunctions like although , unless , therefore , etc., 
occupy COMP-position at any time. Notice, however, that if such 
conjunctions are construed as complementizers, and if the theory is 
modified to allow their generation in the base in such a position, 
then (I.102iv) is powerful enough to account for what I called 
earlier the Optional Adverbial Constraint. Additional specifications 
would, however, be necessary, since adverbials seem to disallow the 
migration of their elements even when they are not tensed, and even 
when they are not clauses. 

1.3.3»5» Finally, (l.l02v) is meant to cover some cases which 
fall under the Complex NP Constraint. Indeed, although both S and NP 
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are cyclic nodes, it is S alone that has a COMP, and some wh-word 
inside a clause in apposition to a sentential noun can only move 
in the COMP of that clause; it will be unable to move on the next 
cycle, since the higher NP phrase has no COMP, and (l.l02v) prevents 

from 'skipping a cycle* (this condition is unnecessary to prevent 
movements out of relative clauses, since those have a wh-word in 
COMP position, and chopping is thus blocked one cycle earlier). 

If (l.l02v) is adequatie for complex HP's, it fails again by 
being too strong in the case of embedded interrogatives • Indeec^^ 
it rules out the entirely acceptable (I.II6). 

(1.116) This is the book which I am not sure whether 

(or not) I told you how to bind. 

This additional example strengthens the case for not attempting 
to nandie interrogative-clause islands in purely formal terms. 

1.3.^. In conclusion, it appears that Chomsky's analysis of ^ 
the phenomena considered by Ross does not go. very far in providing 
a natural alternat ive* Chomsky rightly criticizes Ross for proposing 
a list of ad hoc constraints, but it is difficult to see why his 
(I.102ii)-(l.l02v) chould not be subject to the same criticism. 
Also, the A-over-A Principle, which did seem to have an air of 
naturalness e' out itself in Chomsky's initial treatment, .does not 
live up to r »ctations in Chomsky ■ (1971 ) , owing to the large number 
of cases in .<tich it makes wrong predictions, or in which it makes 
correct predictions for the wrong reasons (e.g., the blocking of 
movements of elements of a coordinate term Just in case the element 
to be moved and the coordinate node happen to belong to the same 
category). I believe we must conclude that Chomsky's 1971 proposal 
comes no closer than Ross* I967 one to providing a natural solution 
to the phenomena at issue. 

1.^. The last attempt to arrive at a descriptively and explana- 
torily more adequate account of. island constraints within the Trans- 
formational Position which we shall examine in this Chapter is Ross* 
Island Constraint (or the IC ) » orally proposed in series of 
lectures at The Ohio State University (spring, 1971). 

(1.117) The I£: If some constituent X of category A 

immediately dominates some constituent Y, 
also of category A, then no constituent Z 
of any category type whatever , which is 
dominated (not necessarily immediately) by 
X can be moved from under the domination 
of X. 

The IC revives the A-over-A Principle, and attempts to avoid its 
observational inadequacies by appropriately weakening and 
strengthening it; this is done by the first and second underscored 
specifications respectively. 

Although I consider the specification of immediate domination 
to be a valuable insight (which I shall incorporate into my counter 
proposal of Chapter Three), I think that the TC must be rejected 



because of at least the following observational deficiencies: 

(1.118) a. It wrongly predicts that clauses moved by- 

Extraposition (and attached to the first 
S-node up) are islands, as can be seen 
in (1.119). 

(1.119) ^"ttio is it possible that Mary loves? 

(1.118) b. It wrongly predicts that Extraposition-from- 
ITP is an impossible rule, 

c. It wrongly predicts that Extraposition-of~ 

PP is an impossible rule. 

d. If prepositional pharses are regarded as 

NP's, it wrongly predicts that prepositions 
cannot be stranded. 
3. It wrongly predicts that nominal complements 
of NP*s cannot be moved, that is, that 
(l.l20a) and (l.l20b) are \mgrannnatical. 

(1.120) a. Which house did I show you a picture of? 
b. Of which cars were the hoods damaged by 

the explosion? 

(1.118) f. If complementizers are Chomsky-adjoined to 
^ the clauses they introduce, it wrongly 

predicts that (I.I21) is ungrammatical. 

(1.121) Who did you claim that John saw? 

(1.118) g. It wrongly predicts that elements of a main 
"clause modified by an adverbial clause 
cannot move, that is, that (1.122) is 
ungrammatical . 



(1.122) The woman who John dislikes, because he suspects 
her of deceit, is seriously ill. 

The applicability of the IC to such structures follows trom the claim 
made in Ross (1967) that they have essentieJ-ly the surface structure 
in (1.123). 



(1.123) 




Ross defended the structure in (1.123) with two arguments: (i) 
the natural place to pause in {l.l2k) is before because , which 
suggests that this is where the major structural boundary lies. 
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(1.121*) John hit Bill because he was furious. 

(ii) Extraposition cannot move a clause around the adverbial, as 
seen in (1.125), a fact which would follow from a structure like 
(1.123) <md the PRC. 

(1.125) a. That Mary is sick is funny because no one 

has been sick for yenrs. 

b. It is funny that Mary is sick because no 

one has been sick for years. 

c. *It is funny because no one has been sick 

for years that Mary is sick. 

The two above argiunents are not entirely conclusive, for the 
major bre<ak in a phrase does not necessarily allow pause (e.g., one 
does not usually pause between subject and verb nhrase), and the 
facts of Extraposition do not work in the same way for non-clausal 
adverbs (the reason is, I believe, that the RRC is an incorrect 
principle, as I shall argue in 2.1.3), as can be seen in (1.126). 

(1.126) a. That John was sick was announced yesterday 

at noon. 

b. It was announced that John was sick yesterday 

at noon. 

c. It was announced yesterday at noon that 

John was sick. 

As it is rather implausible that adverbs are differently adjoined 
to higher nodes depending on whether they are clausal or not, Ross* 
second argument is inconclusive. But there are better arguments 
which can be adduced to support his position. Thus, (iii) As two 
adverbs of the same kind cannot occur in the same clause, we must 
assume that the second adverbial in (l.l27a) and (l.l27b) belongs 
to a higher clause than the main one. 

(1.127) a. John struck his kid because he had been 

naughty, because Mary asked him to do so. 
b. John went out to buy some whiskey, to 
please his fiancee. 

(iv) Negative polarity items like anjr or ever are tolerable inside 
adverbs, provided that the latter are not preceded by pause, as 
seen in (1.128). 

(1.128) a. John wasn't arrested because he had ever 

killed anyone (he was nabbed for stronfr- 
armed robbery), 
b. *.Iohn wen't arrested, because he had ever 
killed anyone. 

The difference in acceptability between (l.l28a) and (l.l28b) follows 
quite naturally if we assume that the latter only has the structure 
in (1.123), since the negative polarity items wou3d be commanded by 
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negation in (l.l28a), but not in (l.l28b). 

I regard the argiuuents (iii) and (iv) above as sufficient for 
concluding that sentence-modifying adverbials have the structure in 
(1.123), and that (1.122) is a. counterexample to the IC. 

(1.118) h. The IC wrongly predicts that adverbs cannot 
be clefted, and this is not in general 
true, as shown in (1.129). 

(1.129) a. It*s because he loves her that John beats 
his wife. 

b. It was (in order) to gain Mary^s confidence 

' that John pretended not know that she was 
rich. 

c. It's since he began the research on his 

dissertation that Bill has gone nuts. 

d. It's after the battle that scores are 

settled • 

e. It's before he sirrived that she died.-'-^ 



TO 

■^-'Many other adverbial types can be clefted, such as the ones 
introduced by instrumental with , before , after , temporal since , 
etc. However, although, as, for , causative since , and perhaps 
others, cannot be clefted; the latter three are interesting in that 
they seem to be logically synonymous with because , but behave 
differently with respect to clefting, as (i) shows. 

(i) a. Iv's because he had no money that John 
dropped out of school. 
fas 

b. <*It's A for > he had no money that John 
VsinceJ 
dropped out of school. 

iM. Geis pointed out to me that the adverbs in (ib), unlike the one 
in (ia), disallow the chopping of elements of the main clause — as 
seen in (ii) — , which constitutes an apparent counterexample to my 
claim (3.2.ii) in Chapter Three, to the effect that movements of 
elements of the main clause are free in principle. 

(ii) a. V/ho did John Jilt because she had no money? 

r-as 1 

b. *V/ho did John jilt < for > she had no money? 

since J 

A third distinguishing property is the ability to occur in 
embedded clauses, as shown in (iii). 

(iii) a. I believe CJohn left the country because he 
was xinhappy3. |^as ^ 

b. *I believe CJohn left the country -{ for V he 
was unhappyO. 1 since^ 
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Thn. ^fw^^*- "^^^if^e^ishing property is ambiguity under negation, 
(ivb) is not^^ ambiguous, as pointed out in Lakoff (1970), while 

(iv) a. John doesn't beat his wife because he loves her. 

b. John doesn't beat his wife \ for I he loves her. 

(_ since j 

A fifth distinguishing property was pointed out to me bv J 
Hutcheson: while (va) can be a well-formed answer to why does John 
beat his wife? , (vb) cannot. ~ 

(v) a. Because he loves hex'. 

/As -) 
b. »-S For > he loves her. 
(_Since J 

A complete study of adverbs is beyond the scope of this 
dissertation, but it seems to. me that "the data presented so far • 
suggest that as-, for-, and since-, clauses must be comments of t h,. 
speaKe r, while this is not necessary for because- clauses ■ if this ' 
IS correct, we have a fairly straightforward explanation for the f've 
facts above. 

?hus, (iiib) is bad, because a comment of the speiiker must bf/ 
directly dominated by the top S-nOde. 

^^Z^^ ambiguous because the be cause- clause can be dominated 
frn ^' ^" ''^^^^ commands the negation, or by 
.the VP-node of the main clause, in which case negation commands it; 
as the latter situation could not arise for as-, for-, or since- 
clauses, (ivb) is unambiguous. 

With respect to the unacceptability of (iib), a number of 
writers have proposed that unbounded movement rules , such as 
Question-Movement, attach an element through Chomsky-adjunction 
U.e., by creating a superordinate node of the same kind :a8 the one 
to which the element is attached) rather than by sister-adjunction. 
Very little evidence has been given in favor of this position, as 
far as I know, but if who is indeed Chomsky-adjoined to the top 
fa-node, the unacceptability of (iib) will follow from the fact that 
the adverbial clauses are no longer dominated by the top S-iiode; thus, 
the badness of iib), supported by the conclusions that we drew from 
the badness of (iib) and (ivb), may be regarded as evidence that 
unbounded movement rules perform Chomsky-adjunction. 

The badness of (ib) and (vb) are, I believe, different mani- 
festations of the same fact. Thus, it has often been pointed out 
that clefted (and pseudo-clefted) constituents, as well as 
(appropriate) answers to questions, represent the main a&ser;ion of 
a sentence, or, as certain writers put it, the new infonnation which 
a sentence conveys; this is also mirrored in the fact that cleft and 
pseudo-cleft constituents as well as answers to questions (and 
question-words, for that matter), must always bear stress. But 
as-, for-, and since-, clauses constitute (parentheticsl) comments of 



the speaker on the proposition expressed in the main clause, and 
consequently cannot be the main assertion of the entire sentence; 
hence, they cannot be cleft constituents or answers to questions^ 
and of course, there can be no quej?tion-word applicable to such 
constituents (that such clauses cannot be pseudo-clefted is shown 
in (vi), where the a-sentence is possible for some speakers, while 
the b-cne is out for everyone). 

(vi) a. Why I didn't want to see you was because 
you had always been nasty. 

(as 
for 
since 

The import of the facts discussed in this note is that very 
slight semantic distinctions can have notable syntactic consequences. 
The same was argued in note 3 to this chapter, where it vras pointed 
out t^hat exceed and more than , although apparently logically 
equivalent, cannot have the same semantic representation. Such 
cases undoubtedly deserve further study ^ (for a discussion of 
related problems, see Shopen (1972)). 



(1.118) i. The IC falsely predicts that adverbials 

cannot be preposed from under the dominance 
of the immediately higher S-node (the 
circled one in (1.123), and this is 
contradicted by (l.l30b), which is derived 
from (I.130a). 

(1.130) a. I wonder whether I could talk to you, in 
case you have a spare minute* 
b. In case you have a spare minute, I wonder 
whether I could talk to you. 

(1.118) J. The IC .falsely predicts that all coordinate 
structures are frozen to the same degree. 
In Ross (1967), the weaker claim that 
only symmetric coordination is subject to ^ 
the CSC is made, but it will be shown in 
detail in Chapter Three that the situation 
is much more complicated than that; in 
fact, different types of coordination are 
subject to different constraints. Ross 
failed to consider all the pertinent cases, 
and even the ones he considered are not 
constrained exactly as he thought, 
k. The IC (as well as the CSC) falsely predicts 
that across-the-board movements out of 
coordinations cannot occur, that is, that 
(1.131) i^ ungramnatical. 



(1.131) It*s potatoes that Bill likes and Jill dislikes- 
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I take it that the objections (I.ll8a)-(l.ll8k) are suff^o<A«+ 
to show that the IC is^ not a tenable proposal, even tSJugh I Jhirf 
II i::.\llTZrll T^'- otherXrVaJits 

as ^ P"^*"* * different- propose which 

ll Zi leL^±tT^r ^'^^ counterexamples tha? viciSe .he ' 
IC, and lends itself to a natural explanation. My criticism of th« 
various proposals I have discussed was not >neant to ^eliUle ?he« 
On the contrary, I regard then as highly interesting aSe»its to 
»ake contributions to the study of sm^.l^rth^facfthL ° 
empirical proposals exhibit flaws should come as no surprise Tn 

Tl'J ^^"^r °^ P'^P^^'^. «^<^n if r^kSg higher 

at various levels of adequacy than tliose I have relected wiS 
Itself turn out to be inadequate in a number of wjs ' 



CHAPTER II 
PERCEPTUAL PRINCIPLES AND GRAMT^IAR 



2'.0^0. This chapter is essentially concerned with providing 
linguistic illustrai;ions of the applicability of the three kinds of 
perceptual principles mentioned in the Introduction, namely, the 
principles of erroneous closure, the principles of perceptuail 
conflict, and the principles of interrupted behavior; these three 
kinds of principles will be discussed in sections 2.1.1, 2.1.2, and 
2.1.3 respectively (the last will be introduced in 2.1.2). 

The theory of perceptual strategies summarized in Bever (1970) 
is briefly outlined in 2.0.1. The last major section of this chapter, 
2.2, is concerned vith various possible minimal distance principles, 
and their failure (as far as I have been able to ascertain) to 
affect grammars in significant ways. 

2.0.1. In earlier writings within the framework of transformational 
grammar, the distinction between "ungrammatical" sentences and sentences 
"unacceptable for performance reasons" was presented as a hard and 
fast one. Most recently, various writers have questioned the validity 



*4?here has also been a great deal of discussion on the treatment 
of semantic ill-formedness, i.e., whether the latter should be handled 
as a syntactic or extrasyntactic phenomenon, but this is largely 
irrelevant to the issues we shall be concerned with. 



of this absolute distinction. In particular, T. G. Bever has 
claimed, in a number of articles, that the internal mechanisms of 
speech production, of speech perception, and of prediction of the 
acceptability of potential sentences are partially independent and 
mutually influence one another. The part of his hypothesis that we 
shall be essentially concerned with is the claim that the strategies 
employed in speech perception may constrain the acceptability, and 
sometimes even the grammaticality , of sentences, and that the out- 
right rejection of certp.in constructions by native informants may be 
explained in a much more illuminating way by considering possible 
perceptual causes than by seeking a purely formal analysis. ^ 



I wish to stess that my adoption — as a working hypothesis — 
of Bever •s claim that feata:j'es of perception may affect the accept- 
ability of sentences should not be construed as an acceptance of 
every aspect of his theory of interacting systems. 

In particular, I disagree with his statement that "transfoiuiations 
are irrelevant to most ongoing speech behavior. This statement 
rests on the fairly well supported notion that the production of a 
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sentence does not simulate its f-enerition by a transformational 
grammar, and that its understanding does not simulate a reversal of 
the grammatically pertinent rules. The most obvious argument 
against equating grammatical rules with real-time processes is that 
the rules of grammar operate on the entire string, while a sentence 
begins to be interpreted as it begins to be heard and need not exist 
in its entirety in the head of the speaker before it is pronounced. 

Notice, however, that this only shows that transformati ons are 
probably not used as real-time .processes, but not that they are not 
used at all^n^behavior. In fact, I find it hard to conceive of a 
person producing or understanding certain sentences without using his 
knowledge about transformational relations , or about other features 
of his language. Indeed, how could one say or understand 

(i) although he is poor, John married a rich girl , or 
(ii) Bill killed his wife and was in turn killed by 
his mistress 

without consciously knowing that poor and rich are antonyms j or that 
the active and the passive are converse relations? Conversely, how 
could one construct a pun without being aware of ambiguities? It 
seems to me that everything a person knows about his language may be 
actively utilized in production or perception, even though this 
utilization need not take the form of grammar-simulation. 

Bever's statement that "competence is performance" has drawn 
sharp criticism. On the face of it, this statement is not Just hard 
to believe, but also in flagrant contrauiction with Bever's other 
claim that transformations are intuitions not used in actual speoch 
behavior, which implies that one's knowledge about language and one's 
linguistic behavior are at least partially distinct. I believe, 
however, that these difficulties are due to Bever's failure to define 
his notion of competence carefully enough. Indeed, Bever uses 
'competence" in the special sense of "what -a speaker consciously 
knows about his language," and claims that a speaker's intuitions 
are limited to acceptability judgments, but shed no light on the 
sources of unacceptabilities. Thus, if competence is defined as the 
sum of a speaker's intuitions and if expressing intuitions is a form 
if performance, it is clear that the distinction between performance 
and competence becomes a trivial one, and one may claim that 
competence is part of performance, or that "competence is, performance" 
(which does not mean that performance is competence). 



Bever refers to a number of experiments which suggest that the 
various mechanisms which are integrated in the adult are acquired 
independently by the child. Thus, children seem to go through three 
stages during their linguistic development. Very young children 
appear to depend on "basic linguistic capacities," such as the belief 
that words refer to objects and actions in the world, or that there 
are functional internal relations between parts of sentences, such 
that they express the roles of the various actants. Children around 
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the age of seven begin to be able to appreciate the regularity of 
the relation between actives and passives, which suggests that they 
have begun to acquire a grammar, i.e., an abstract system which 
enables them to make sophisticated predictive judgments about 
sentences. However, children between the ages of two and six appear 
to rely rather heavily on perceptual strategies like (2,1) and (2.2). 3 



3fphese are, respectively, the principles Bever labels (B) and 
(D) in Bever (1970). 



(2.1) The first N. • .V. (N) .clause is the main clause, 

unless the verb is marked as subordinate* 

(2.2) Any Noun-Verb-Noun (NVN) sequence within a 

potential internal unit in the surface structure 
corresponds to " actor-action-ob^ject 

Thus, in an experiment conducted with children between 1 1/2 and 
2 1/2, it was found that when presented with a complex sentence like 
the cow tha t Jumped walked away. , and asked to act out, some children 
ITct out tTie doubly-xmderscored portion. It appears that they considered 
the first clause to be the main clause, in other words, that they 
followed (2.1) completely, and that they were unable to make use of 
overt marks of subordination. It is hard to tell whether they can 
recognize such marks at all, but whatever the situation, a strategy 
like (2.1) appears to override other considerations. Similarly, it 
was found in another experiment that the performance of children on 

reversible" passives (i.e.., sentences in which semantic or contextual 
information do not give any clue as to the roles of the actants) 
deteriorates considerably around the age of four. The suggested 
conclusion is that children begin to depend on a strategy like (2.2) 
at that age. 

Bever does not attempt to decide whether strategies like (2.1) 
and (2.2) are inductions over experience or whether they are determined 
by maturational development. He points out, however, that even an 
empiricist view of the acquisition of such strategies which would 
claim that they "are formed in response to natifred probabilities in 
the actual speech that the child experiences," would have to explain 
why the children select precisely these strategies out of the many 
alternative possible generalizations; in other words, it would have 
to "include a nativist component" (Bever, 1970). 

The claim that children acquire a number of perceptual strategies 
before they have a complete grammar is crucial for the" claim that 
grammar may be constrained by perception. Indeed, if certain 
perceptual strategies already exist at the stage at which grammatical 
generalizations are acquired, the child may constrain the form oof some 
of his grammatical rules by incorporating perceptual restrictions 
directly into his predictive system. In such a case, we can say that 
a behavioral phenomenon has been "grammatized." This is, however, 
not the only possibility, for the grammar the child acquires need not 
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be, and in all probability never is, affected by all his perceptual 
strategies. In .a large niamber of situations, perceptual strategies 
survive without affecting the grammar, and they lead to unaccept- 
abilities of various degrees, rather than to downright ungrammaticality. 

In addition to the "positive" strategies that children acquire, 
there are almost certainly constraints as to what kinds of strategies 
can be acquired. To the extent that such constraints reflect 
limitations of the processing device, they must be universal. However, 
as behavioral limitations determine gradual, rather than abrupt, 
complexity, they can lead to two kinds of situations, exactly like 
the positive strategies: they can either be grammatized, and thus 
become grammatical constraints, or they can survive as purely 
behavioral, limitations , in which'case the degree of unacceptabi.lity 
will be proportional to the "amount of complexity" present in the 
string. 

As Sever himself points out, it is not at all a simple 'matter 
to decide which cases of unacceptability are due to "purely syntactic" 
factors and which are due to behavioral ones. Indeed, the only data 
available to us are intuitive judgments as to the relative accept- 
ability of discourse, but intuition is not a very reliable tool for 
discriminating between ungrammatical and "strictly unacceptable" 
sentences, and is in general powerless to ferret out possible 
behavioral causes when some unacceptability has been grammatized. 
Therefore, it is to be expected that the causes of certain unaccept- 
abilities be undecidable, at least for the time being, and that even 
the most believable claims as to perception affecting grammar may not 
be provable beyond a certain degree, of doubt. I believe, however, 
that we can regard a case as having been convincingly presented when 
it can be argued that some set of apparently unrelated .phenomena 
(which coxild be accounted for formally only by invoking arbitrary 
and ad~hoc features) receive a simple and straightforward explanation 
in behavioral terms. 

With respect to the problem of distinguishing ungrammatical 
from merely unacceptable sentences, it would be unreasonable as I 
pointed out above, to require a fool-proof procedure. However some 
rules~of-thumb may prove useful. 

With respect to erroneous closure, whose effect is to "lead the 
hearer up the garden path" (so that the acoustic signal has to be 
reinterpreted if one is to make sense out of it), we may regard the 
difficulty as grammatized, if the sentence x*emains bad even after it 
has been pointed out to the hearer that he had been led up the garden 
path ; on the other hand, if the sentence becomes acceptable when the 
hearer has been told where closure should take place, we may assume 
that there has been no graimnatization. 

Concerning cases involving interrupted behavior, we may assume 
that grammatization has not occurred $ if we get a gradual acceptability 
s cale (rather than a g ood/bad dichotomy )^ as we vary the values of 
the pertinent parameters (length, complexity, predictability, etc., 
of the intervening materieLL). An apparent counterexample to this 
generalization would be the case of multiple center-embeddings , whers 
there seems to be an abrupt jump from tolerable acceptability to 
very high unacceptability when we move from one degree to two degrees 
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of embedding. However, center-embeddings are essentially different 
from the cases in which we get gradual judgments. Thus, the latter 
are found with one instance of interrupted behavior, while center- 
embeddings to a degree higher than one involve at least two independent 
instances of interrupted behavior which must be handled simultaneously; 
the situation is therefore radically different, and, in addition, there 
seems to be some perceptual conflict involved in multiple center- 
embeddings (Bever, 1970; for additional discussion, see notes lb and 
19 to this chapter). 

Regarding cases which exhibit perceptual conflict, V3 may assume 
that gramraatization has not occurred if sentences become acceptable 
after we have trained ourselves to overcome the difficulty involved 
and to puzzle out the meaning of the corresponding sentences; if the 
sentences are still bad when their meaning can be apprehended with 
comparative ease, we may regard this as an indication of grammatization. 
For example, if perceptual conflict is involved in John^s father *s 
brother or in it*s not the case that John^s not goinp? to the p«arty had 
no effect on Mary — as claimed in Bever (1970) (see also note 18 to 
this chapter) — I believe that the conflict is not grammatized; in 
particular, expressions of • the former type (and of the same or greater 
length) are probably used quite commonly in euithropological literature. 
On -the other hand, if perceptual conflict is involved in the badness 

he,' said that John^ vas sick — as I argue in Chapter Four — grammati- 
zation probably has^ occurred, for the feelin/r of deviance persists after 
it has been made cYear to the hearer that he and Jobn stand for the 
same referent. 

By the criterion I have just offered, center-embeddings like 
the boy the teacher the girl kissed spat at died would be regarded as 
ungraramatical rather than merely difficult, for it is hard to believe 
that people could become entirely comfortable with such sentences. Most 
writers have, however, argued that the grammatical restriction would be 
hard, if not impossible, to state, as it is not true that all center- 
embeddings are as bad as the previous one; for example, the surgeon 
who the girl !^ that hoodlxim raped vas afraid of had won high 
distinctions. irTmedical school is quite acceptable (the symbol 
indicates overlong pauses). Therefore, it is at least doubtful 
whether multiple center-embeddings should be regarded as ungrammatical 
in English. This indicates that the various criteria I have proposed 
do not always yield the desired results; this is, however, not 
surprising, since I have made it clear that they are rules-of-thumb, 
and nothing more. 

2.1. We shall now consider the three perceptual principles 
mentioned above in some detail; 

2.1.1. The strategy (2.1) mentioned in 2.0.1. is one factor 
capable of giving rise to erroneous closure. The point is discussed by 
Bever (1970 )* who acknowledges his indebtedness to Kirk. Thus, as 
can be seen from the paradigm in (2.3), when a subordinate clause 
appears initially, the mark of subordination is not deletable, even 
though it may be deleted in other positions. 
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(2*3) a. It is odd that John dislikes sweets. 
It is odd John dislikes sweets. 

c. That John dislikes sweets is odd. 

d. *John dislikes svreets is odd. 



Bever also points out that strategy (2.1) predicts that subject 
relative pronouns of restrictive relative clauses are not deletable, 
as the following tensed verb would be interpreted as the verb of the 
nain clause. However, if the tensed verb is be, ar.w -f the latter is 
in turn deleted, the result is grammatical, for either the reminder of 
the verb is not tensed and can only be subordinate, or there is no 
verbal form left which would enable (2.1) to interpret the string 
erroneously. These facts are exhibited in {2.k). 



{2.k) 



a. 
b. 
c. 

d. 

e. 
f. 



h. 



The^map who came late forgot his money. 
^ The^^'man came late forgot his money. 
The senator who was in the saddle shot from 
the hip. 

* The senator was in the saddle shot from the 
hip. 

The senator in the saddle shot from the hip. 
The monkey who was running after the bus slipped 

on a banana. 
* The monkey was running after the bus slipped 

on a banana. ' 
The monkey running after the bus slipped on a 

banana. 



The underlined sequence in (2.Ub, d and g) is interpreted by (2.1) as 
the main clause, and the remainder of the sentence in question cannot 
be made sense of. 

As far as I can tell, (2.1) is gramraatized in most (possibly in 
all) dialects o,f English with respect to the facts exhibited in (2.3) 
and (2.U); that is, there are no dialects that I know of in which 
(2.3d), (2.Ub), (2.Ud) or (2.Ug) are grammatical. There is, however, 
an interesting difference between (2.3) and (2.U): vhile a complementizer 
can be deleted in object position — as shown by the grammaticality of 
(2.2b), the deletion of a subject relative pronoun is usually 
prohibited in both subject and object position, even though this does 
not follow from (2.1); that is, many dialects do not allow I^m a man 
loves his beer cold , although some do (the sentence seems to be at 
least easier to comprehend than (2.Ub), (2.Ud), or (2.Ug) for anyone). 
As I pointed out in the Introduction, the underlying complexity/ 
acceptability correlation may be obscured through rule-generalization 
and other phenomena typical of language change, and I believe that the 
restriction on relative pronoun deletion in non-subject position in 
certain dialects of Modern English is a case in point. I also 
expressed the view that the lack of transparency of a posited under- 
lying complexity /acceptability correlation should not discourage 
efforts aimed at proving the existence of such a correlation; in the 
case under discussion, supporting evidence for the relevance of (2.1) 
to (2.3) and {2.h) has in fact been found. 
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Bever points out that many subjects Judge (2.5a) to be just as 
bad as the starred sentences in (2.U) on a first presentation . 

(2.5) a. The horse raced past the barn fell, 
b. The horse seen past the barn fell. 

liowever, when it is pointed out to them that (2.5a) should be inter- 
preted similarly to (2.5b), they judge (2.5a) as entirely acceptable. 
It appears therefore that (2.1) is grammatized only when the verb is 
u nambiguously tensed. The fact that subjects nevertheless tend to 
reject (2.5a) on the first presentation strongly suggests that a 
strategy like .(2.1) is operative, for otherwise we would expect (2.5a) 
to be rejected only about 50 per cent of the time, depending on whether, 
the hearer happened ^to interpret raced as a tensed or an untensed form.^ 



This is, unfortunately, not quite correct, for there are 
additional variables which viciate this conclusion. Thus, (a) the 
causative reading of race, is somewhat less common than the non- 
causative reading, and (b) the causative reading requires that the 
verb be passive, which, as has already been, shown experimentally, is 
a more complex form than the corresponding active one. Assiaming a 
strategy by which, given two options (and all other things being equal), 
subjects tend to select the less complex option (see also principle 
(2.22) below in the text), it is conceivable that this strategy may 
also determine the perception of race d (2.5a) as a tensed verb, over 
and above (2.1); consequently, the initial rejection of (2.5a) cannot 
be unambiguously attributed to (2.1). 

This shows the enormous difficulty of constructing meaningfiil 
experiments for syntactic perception, for the huge niimber of inter- 
acting cuss makes it extremely hard to control all the pertinent 
variables. ^ . 



Another experiment cited in Bever (1970) as supporting evidence 
for (2.1) concerns the two pairs of second degree center-embeddings 
in (U.6). 

(2.6) a. The editor the authors newspapers hired liked 
laughed. 

b . The editor authors the newspapers hired liked 
laughed . 

Subjects were asked to paraphrase sentences like the above, and 
experienced much greater difficulty with the b- than with the a- 
type* This can be again accounted for by (2.1), for, while both 
construction types involve two interruptions, as well as perceptual 
conflict, the b-sentence also invites erroneous closure (the underscored 
string being a possible sentence in (2.6b) but not in (2.6a)). 

In tin interesting recent paper, Chapin, Smith and Abrahamson 
(1972) suggest that Bever •s strategy is in fact a particular instance 
of a more general strateQ^ which they formulate as follows: 



(2.7) In imposing an initial structural description on 
a sentence, the S(ubject) attempts at each 
successive point to close off a constituent 
of the highest level possible. 

They argue that (2*7) can account for all the cases covered by (2.1), 
and also for the following experimental results which they report on: 
Subjects were presented with sentences like those in (2,6c) and (2.6d); 
in the former, the first major boundary is the main break. Clicks were 
located midway between the two boundaries, and subjects were asked 
to report the location of the clicks. 

(2.6) c. The Chairman of the Board should give 
whoever invented that process a prize, 
d. Women who are over sixty years old and kids 
must have the first chance to escape. 

In sentences like (2.6c), there was a significantly greater* 
tendency to prepose the click towards the earlier boundary than to 
postpose it towards the later one; this was interpreted as a tendency 
for clicks to migrate towards the main sentence break. On the other 
hand, in sentences like (2.6d), there was no greater tendency for the 
clicks to postpose than to prepose, although the main break was the 
second one; Chapin et al. propose that the factor which interferes with 
the attraction exerted "by the main sentence bresdc is precisely (2.7), 
which prompts subjects to assume that the underscored string in (2.6d) 
is an argument of the main clause, while it is in fact only a term of 
a coordinate argument pf the main clause ♦ 

Another instance of perceptual complexity explainable, by (2.7), 
but not by (2,1), is (2.6e). 

(2.6) e>??A rti3t8 girls, boys and women h urry a lot 
tend to resent it. 

This sentence was mentioned by Sever in a public lecture at the 
Linguistic Institute in 1971. This sentence is bad because the string 
ending with lot tends to be misconstrued as the main clause; however, 
(2.1) cannot te'll the whole story here, for the deletion of non- 
subject relative pronouns is perfectly possible in principle. 
Thus, artists pirls hurry a lot tend to resent it is perfectly 
acceptable , presumably because the lack of pause between artists and 
girls prevents a coordinate reading. The cue constituted by the 
absence of pause after a rtists in (2.6e) is apparently overridden by 
•another strategy, which Sever formulated roughly as "a sequence of the 
form IIP ^-CB?^ interpreted as a coordination." Observe, however, 

that '(2.7) can account for the fact that the underscored sequence in 
(2.6e) tends to be perceived as a coordinate NP, without requiring 
an additional strategy: this sequence can be misconstrued as subject 
of the main clause, and premature closure occurs after women , when in 
fact it should have occurred after lot . Thus, as the underscored 
sequence has been misconstrued as the subject of the main clause^ it 
plus the sequence hurry a lot can be misconstrued as the main clause 
(it appears that (2.7) applies twice in this sentence). 
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It shoiild be pointed out that (2.7) is inadequate as it stands: 
If subjects indeed try to close off a constituent of the highest 
level possible as soon as possible, then why should they close in 
(2.6e) after women and not after artists? After all, the latter is 
a perfectly possible subject of some main clausel Similarly, Chapin 
et al, argue that the most natural order of prenojninal adjectives 
in English (which, as pointed out in Vendler, I96I, requires that 
adjectives be arranged from left to right as they increase in 
"nominality") maximizes the efficiency of (2.7), since it allows the 
iiP to be closed as soon as an element of lesser nominality than the 
preceding one is encountered. Thus, the delightfia red plastic cup 
is more acceptable than ? the delightful plastic red cu p, because, 
according to Chapin et al., the latter invites erroneous closure after 
plastic , however, why does closure occur after cup in the former, 
rather than after plastic or -even red , for both the delir;htful red 
plastic or the delightful red a re theoretically possible NP's? It 
appears that there is something missing in (2.7), and that an sonended 
formulation something like (2.8) would be in order. 

(2 #8) In imposing an initial structural description 

on a sentence, the 5(ubject) attempts at each 
successive point to clos^ off a constituent 
of the highest level possible; however, closure 
is suspended until some significant cue is 
encountered. 

\^at can constitute a "significant cue" remains to be investigated 
for each construction type* In the case of the natural order of 
adjectives, that cue apparently is the occurrence of a constituent 
of lesser nominality than the preceding one, which acts as a NP- 
boundary marker; in coordinations, a helpful cue may be the presence 
of a coordinating conjunction, which signals that the last member of 
a coordination is about to come (or has already occurred, in languages 
like Latin); however, this cue is not available in languages which lack 
an overt morpheme and and conjoin by Juxtaposition. In any event, 
closure must be suspended until some cue carrying sufficient weight 
occurs, or coordinations should be unacceptable, optional sentence- 
final adverbial modifiers should create perceptual problems, and 
sentences like John screamed at me or Bill is eating an apple should 
receive low acceptability ratings becaus'J J ohn screamed and Bill is 
eating are possible English sentences. 

Before closing this section, it is appropriate to try to pinpoint 
some factors which may tend to favor the gramraatization of complexities 
resulting from erroneous closure (of course, complete and rigorous 
predictions are, as I have stjressed a number of times, strictly a goal 
for the remote future). Specifically, we may ask why (2.5a) "improves 
upon acquaintance," while (2.Ub) does not; that is, why the former 
becomes acceptable once it has been pointed out to the hearer that 
raced is a past participle (as laid down in the rule~of~thumb I gave 
at the end of section 2.0.1). My guess is that grammatization tends 
to be inhibited when there exists some overt feature of the string on 
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which the mistaken initial interpretation can be blamed. In (2.5a), 
the blame can be laid on the hearer's failure to select the correct 
homonym for the item raced. In the starred sentences of (2.1*), 
however, the hearer can only be blamed for his failure to reconstruct 
an item whose logical presence is not signalled by any overt cue; as 
the hearer has not misinterpreted any of the overt features of the 
string, there is no way in which he can devise a strategy for avoiding 
erroneous closure in the future (i.e., there is nothing to which he 
can try to pay better attention), and it is reasonable to assume that 
such sentences will tend to be avoided (and ultimately thrown out of 
the language). 

Selection of the wrong homonym is certainly not the only mistake 
which may be corrected by paying better attention to some overt cue 
in the acoustic signal. Thus it is to be expected that supra- 
segmental cues may also inhibit grammatization; for example, it is 
possible to distinguish intonationally between C John or Bill or Mary] 
and C John or C Bill or Moxyl l (essentially, by uttering the lower level 
coordination more quickly). This intonational^ option presumably 
enables (2.6e) to improve upon acquaintance, as it should be possible 
to make it clear by pitch, stress and/or use of pauses that only the 
last three of the first four NP*s form a coordination. On the other 
hand, it is hard to see what intonational cues could keep the two 
putative readings of the underscored strings in (2.1*) apart: since 
only nonrestrictive relative clauses allow the deletidn of the pronoun 
in English (compare the man I hit is sick with ^ the man, I hit, is 
sick ) there is no way to signal the intended relative pronoun by 
pause; on the other hand, short subjects cannot be separated by pause 
from their verbs either, so that pause cannot distinguish between the 
two readings here. Notice also that the relation the remainders of 
the imderscored strings bear to the initial NP is exactly the relation 
they bear to the deleted pronoun, since both elements are subjects; 
as these remainders would be VP^s of the same type on either reading, 
there is no way in which their suprasegmental structvire could force 
one reading rather than the other. Notice aliso that the two putative 
subjects can always allow the same VP^s (since they are coreferential 
and therefore necessarily enter into the same selectional restrictions), 
while erroneous closure in sentences like (2.6e) is only possible when 
the verb occupying the position of hurry can be construed as either 
transitive or intransitive (compare (2.6e) with the more acceptable 
artists girls, boys and women hkte the guts of usually develop 
suicidal tendencies! ^ I assume that the fact that there are unambi ruous 
sentences consti-ucted on the pattern of (2.6e) except for the lack of 
an intransitive reading for the counterpart of hurry may also 
inhibit the tendency to throw (2.6e) out of the language. 

I have discussed several linguistic illustrations of the operation 
of a principle of erroneous closure (tentatively formulated as (2.8)), 
and have also attempted to pinpoint some of the factors that may favor 
graramatization. Additional (mostly grammatized) instances of erroneous 
closure will be considered in^the ensuing sections and chapters, and 
I shall argue that their grammatization can be accounted for 
essentially by the same principle I have postulated here, namely, that 
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erroneous closure tends to result in \angraimnaticality when it cannot 
be avoided by concentrating on some overt feature of the string (which 
may have been overlooked on the first take). 

2.1.2. This section is devoted* to the so-called "cross-over 
phenomena" discussed in Postal (197I). Postal recognizes two broad 
subclasses of phenomena, namely, cross-over restrictions on bounded 
rules, and on unbounded rules. I shall argue that the former subclass 
is essentially explainable in terms of a conflict principle — (2.23) — , 
and that most of the phenomena in the latter can. be explained on the 
basis of an interruption principle — (2.37) — coupled with the already 
mentioned principle (2 .22; , which, in strategic terms , translates as 
the instruction "whenever possible, choose the simplest alternative." 

In his book. Postal presents the cross-over constraints as 
purely syntactic, on the grounds that the sentences they rule out 
"have unique interpretations and involve no semantic violations at 
all." Obviously, Postal did not consider the possibility that his 
phenomena might have a psychological explanation, but I shall argue 
that such an explanation can be both simpler and more illuminating. 

The bounded rules that Postal is concerned with are Passive, 
Subject-Raising, Tough-Movement, About -Movement and Flip, and these 
are illustrated in (2. 9) -(2.13) respectively. 

(2.9) a. John stabbed Mary . 

b . Mary was stabbed by John. 

(2.10) a. It seems to me^ that John is stupid, 
b. John seems to me to be stupid. 



(2.11) 



a.« 
b. 



It is difficult for Mary to shave Bill . 
Bill is difficult for Mary to shave. 



(2.12) a. I talked to John about Mary , 
b. I talked about Mary to John . 

(2.13) a* I_ am amazed at John , 
b. John amazes me. 



Postal points out that in his idiolect — and in a large number of 
dialects — the b-sentences in the above set become ungrammatical if 
the underscored HP's'are coreferential , as in. (2.lli)-(2.l8b) . 



(2.ll*) a. John stabbed himself. 
(2.15) 
(2.16) 

(2.17) a. I talked to Bill about himself . 

b. ''^I talked about Bill to himself. 



b . *John was stabbed by himself . 

a. It seems to me^ that am stupid. 

b. *I^ seem to myself to be stupid. 

a. It is difficult for me to shave myself . 

b. am difficult for myself to shave. 



t 
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(2.18) a. I m caazed at myself , 
b. 71_ amaze ayself . 

At first blush, we seem to be dealing with a purely syntactic 
restriction, which prevents bounded movement rules "^SomcaSng i^ NP 
to cross over another NP with which it is coreferential . Z^eveT 

FTrlt'ir"- ^T"-^' " ^o-^ these nnes. 

First the existence of some of the transformations claimed to 
be involved in the above set has been thrown into doub^ by v^ious 
writers. Thus, Jackendoff (1969), offered some evidence LI^s? 

S Fp^a'Sr?' T^.'^^f^ ^'T'^i points- out that he doe^'not claim 
that (2.18b) is derived from (2.38a), or that sentences related by 

If (2'i3ar^r?^ i^br°"'^rf • ^^^^^^ ^-^^^ p^^^^^^ ^^at 

^f a is claSL^?; S^-"' ""^^^""^ *° """^^^"^ transformationally, and 
vT f Claimed to be in some sense more basic than b because its ^«rfnL 
subject is its logical subject (an experiencer) , We Se othS pa^^s 
Ld^'tlirtf like bui: and sell, express Converse relations! 

T.til ii ^ T "° choosing one of them as more 

Flip-cases is not nearly as clearly established as for the 
other four cases; indeed, most people that I have consulted find (2 l8b) 

PostS Irs'f f " °^ sentences Ihi^ ^ 

" mill or.i'/'^- * .1^"" acceptable to n^vself or I am loathsome to • 
ini:sel£ still, my informants found other examples of Postal's less 

find'(f-175T^^ "^rf as pompous. \e this as it maJrpeople 
?,^rf /.w ^^^^^ (2.ll»)-(2.l6b), and if it should . 

nriirl J^'f ' "° Tough-Movement rule, the entire cross-over 
principle would be thrown into doubt. 

,„.J"f"*^' P°^"*^ o^*-' ^ interesting exception to the 

restrictions on About -Movement; the restrictions do not hold i"^ the 

^ro^nir f coreferential with the two NP's that 

cross over each other, as in (2.I9), 

(2.19) a. I talked to myself about myself. 

b. I talked about myself to myself. 

In order to handle this case. Postal invents the notion of pronominal 
nrgin I.e., an NP which has not yet been pronominalized £ ?he stage 
SJ^J ^^""^ transfomation applies, and requires that the cross-over 
r^nlt'^??' apply only to pronominal virgins. Then, the desired 
PosS^ usis i« ordered before about -movement . , 

Postal uses the notion, of pronominal virgin also in attiring to 

rffSver b^t di''f?"%°".""?°^'^' "movements, which do'notVf?ect ' 
reflexives but do affect simple pronouns; his solution is to order 

SoZinS,-^Sr" S^-movement. This proposal assuass that f 

and m^i^f ^" rloTTwhich has been seriously questioned lately), 

and makes a number of assumptions about rule ordering Jhich the author 

solution S:!/'"""''? "doubtful" (in Postal, 1970). Moreover, this A 

l^llZ as t^rdo?''""' constraints. .} 

MP • "^J^^^? that the constraints do not hold if the second t 
NP IS heavily stressed^ ar.i especially "if such particles as even a^e ] 
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added" (p. 9). Thus, compare the sentences in (2.20). 

(2.20) a. *John is hated by himself. 

b. John is hated by himself. 

c. John is even hated by himself. 

Postal points out that his fonnulation of Cross-Over does not account 
for the acceptability of (2.20b, c) and devotes a separate chapter 
(Chapter 19) to a discussion of this problem. Basically, he proposes 
that coreference can either be binding or predicational . and that the 
latter kind (illustrated by (2.20br^) is exempted from the constraints. 
Binding coreference arises when the speaker presupposes two NP's to 
be coreferents, but the sentence is about something else, while pre- 
dicational coreference arises when no identity is presupposed and when 
asserting coreferentiality between two NP's is the comment of the 
sentence. IVhile this account may well be correct, it is not obvious 
why Cross-Over should single out binding coreferentiality as its domain 
of application. 

We shall now consider a possible explanation for the unacceptability 
of the b-sentences in (2.llj)-(2.l8). Let us begin by noticing that the 
b-sentences m (2.9)-(2.13) are psychologically more complex than the 
corresponding a-sentences (or, to use Prague-school terminology, the 
a-sentences are "unmarked," while the b-sentences are "marked"). Indeed 
in the case of the passive, it is clear that it is more complex than 
tne active, since children acquire it much later and also, many adults 
tend to avoid passives, whenever possible). Bever suggests that the 
greater complexity of the passive is due to the violation of (2.2) 
(in fact, Beyer's "actor-action-object" should be replaced by something 
like logical subject-predicator-logical object," to cover cases in 
which the subject is inanimate, and therefore an instrument, or in which 
the verb is stative and the subject an experiencer) . I do not know what 
?o i^t i^""® °^ acquisition is for the members of the pairs (2.10)- 

1>fo o\ complexity of the passive is indeed due to a violation 

of (2.2), then a similar violation occurs in (2.13b). Indeed, only the 
HP-slot filled by me is necessarily animate, as can be seen in (2.21). 

(2.21) a. I am amazed at the scenery. 

b. «The scenery is amazed at John. 

c. Thi» scenery amazes me. 

d. *John amazes the scenery. 

With respect to the b-sentences in (2.I0) and (2.11), they are more 
complex than the a-sentences for they involve logical discontinuities 
m surface structure. Indeed, the predicate seem takes two arguments, 
an experiencer and a sentential object. The latter forms a surface 
unit m the a-sentence (t hat John is stupid) , but is broken down into 
two separate parts in the b-sentence (JqhT'and to be stupid ). Similarly, 
the predicate difficult also takes an experiencer and a sentential 
object (I assume, following Jackendoff, I969, and Chomsky ,' 1971 , that 
an earlier stage in the derivation of (2.11a) is it is difficult for 
Mar/i for Mary,- to shave Bill). Like the sentences in (2.10), (2.11a) 
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Dxhibits a continuous sentential object, (for Mary) to shave Bill ^ 
while (2.11b) exhibits a discontinuous one. Bill and (for Mary) to 
shave. Finally, we can argue that (2.12) -is imnarked because the 
linear order of its three KP's expresses the natural hierarchy cf their 
roles, since John is an experiencer, and therefore necessarily animate, 
while Mary is an object and can be replaced by inanimate or abstract 
nouns, as veil as by nominalizations ; on the assximption that animate 
roles are higher in the hierarchy than inanimate ones, the failure of 
the linear order of (2.12b) to reflect this fact makes it more marked 
than (2.12a) > 



It would be tempting to generalize from these facts and claim 
that the unmarked order of arguments always reflects the hierarchy of 
their roles. However, there are obvious difficulties with such a 
proposal. For example, the unmarked ordering of Instrumental and 
Objective differs according to construction, as in (i): 

(i) The man broke the window with a hammer* 
(ii) *The man broke with a hammer the window., 
(iii) .The manner broke the window* 
(i^')^The window broke the hammer* 

As we can see, (ii) is ill-formed, and (iv), even though it is probably 
well-formed grammaticcdly and odd for non-linguistic reasons, is not a 
paraphrase of (iii). 

Furthermore, it is not even clear whether animate cases (i.e., 
Agent and Kxperiencer) necessarily precede all other cases in unmarked 
order. Thus, it is not clear that (v) is less complex than (vi). 

(v) I gave Jolin a book, 
(vi) I gave a book to John. 

Perhaps, then, the feeling that (2.12a) is less complex than (2.13b) 
is merely due to the former's greater ffequeacy of occurrence in English* 
In that case, the phenomenon would be no more remarkable than that 
place adverbs usually precede 'me fidverbs in Ktiglish, while the opposite 
situation obtains in other la.,Qr\ages, like French or Rumanian. Be 
this as it nay, the explanation of cross-over phenomena that I wish to 
propose depends only on the fact of markedness and not on its cause . 
Notice, incidentally, that the more general strate<^ (2.2) must also be 
lang-.age-specif ic, as one would not expect it to develop in a language 
with unmarked surface order VOS [apparently, Malagasy is such a 
la:3Guage, according to E. Keenan (public lecture, 1971 Linguistic 
Institute)]. 



Wow that the facts of markedness are established for the pairti in 
(2.b)-(2.13) , it is in.portant to notice that if the members of the 
various pairs are cof^nitively synonymous , they differ with respect to 
focus . 5 



-^The term 'focus* is often used in the literature as a synonym 
of 'comment/ and is thus contrasted with 'topic' However, I use 
'focus' in this section to mean 'that which is more prominent 
subjectively.' Thus, in (i) below, the focus coincides with the 
topic (John); however, in (ii), the focus coincides with the comment 
( a snake , the stress being contrastive) . 

(i) John saw a snake, 
(ii) John saw a snake.. 

This use of the term 'focus' is also found in Partee (1971), who 
notes that 'adding CcontrastiveD stress Can instance of comment] 
and raising to subject Can instance of topic3 are two ways of 
accomplishing the same restilt, namely the bringin^^ into focus ' 
(underscore mine). 

As I have defined it, focus is a relative notion, and ther^e may 
be more than one focus per sentence. In fact, the explanation of 
the grammaticality of (2.19b) which I propose below rests precisely 
on the assumption that sentences may be plurifoceG.. 



Change of focus can be achieved in a variety of ^.'ays (heavy 
stress, clefting, unbounded topicalization rules, topic-introducers 
like a s for , with respect to , etc.), and in the cases "at issue, it 
can be achieved through the use of a marked form, which causes an 
earlier appeeirance of some argument ^d.th respect to the unmarked 
form, thereby throwing this argument into focus and defocusing the 
eirgument which constitutes a focus in the corresponding unmarked 
form. Thus, Passive and Flip focus on the logical object and defocus 
the logical subject, Subject-Raising and Tough-Movement focus an 
element of the sententieJ. object and defocus the experiencer, and About- 
Movement focuses the about - phrase and defocuses the exjieriencer. Clearly 
however, as focus is created by displacing arguments within a sentence , 
it is only the eetrliest appesimace of some NP in that sentence which 
can have focusing effect (the pertinence of this fact will become 
obvious below, when (2.19b) is discussed). 

I shall now introduce two fairly obvious principles based on 
experience, which, I believe, can explain not only the problems 
connected with the bounded rules, but also with the unbounded ones. 

(2.22) Unless there is an indication to the contrary, 

one tends to aissume that people do not 
complicate matters unnecessarily. CThis 
principle is a reformulation of a conversa- 
tional implicature proposed in Grice (1968)3. 

(2.23) When something is done for a ptirpose, and that 

purpose cannot be achieved in principle, 
perceptual conflict arises. 
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{2.23) can provide a simple explanation for the unacceptability 
of the b-sentences in (2.li|}--(2.l8) . Indeed, a hearer naturally 
assumes that a speaker uses the more complex marked form i n order to 
focus on some liP and defocus another; however, there turns out to be 
no such possibility in these cases, since the focused and defocused 
NP's refer to the same entity. In other words, the use of the marked 
form is pointless. 

To put the matter another way, the sentences in question are 
odd because they create conflicting. assumptions in the hearer; the 
fact that the marked form is used leads the hearer to believe that 
the focused and defocused entities are different; on the other hand, 
the existence of a reflexive pronoun tells the hearer that the speaker 
presupposes that the two entities are not different. The reason why 
these sentences oxe odd but interp etable, is that the assumption of 
coreferentiality alone is a logical presupposition, while the assumption 
of non-coreferentiality is merely an implicature. If both assmptions 
were logical presuppositions , the sentences would be semantically ill- 
formed, like my brother, Mary, is sick . 

Let us take a quick look at the various problems that I earlier 
claimed arise if the cross-over phenomena are treated in purely formal 
terms . 

First, the cross-over principle, as formulated by Postal, depends 
crucially- on there existing a movement transformation . Nothing of the 
kind is required for a principle like (2.23) to be applicable. Indeed, 
all we need is the existence of pairs of cognitively synon3anous 
sentences, which differ in perceptual complexity and in focus 
potentiality; the synonymy need not, however, be statable trans- 
formationally. I believe that the sentences related by the putative 
Tough-movement rule are synonymous (at least in the vast majority of 
cases, since the counterexamples advanced in Jackendoff (19^9) are 
sufficiently rarfe for us to be able to assume that some learners may 
never hear them). Non-synonymy, on the other hand, seems to be the 
rule rather than the exception for 'flipped' sentences, and this is 
probably v;hy the constraints are very much weaker or totally non- 
existent for most speakers. 

Second, in order to account for the acceptability of (2.19b), 
Postal had to invent the notion of pronominal virgins and stipulate 
that the cross-over restrictions apply to pronominal virgins only; 
this in turn called for a number of assmptions.. about rule-ordering. 
Notice, however, that (2.19b) is not niled out--or marked as odd — by 
either (2.23) or (2.22). Indeed, due to the fact that there is an 
earlier NP — the subject — which is coreferential with both reflexive 
pronouns, the about- phrase fronting can no longer have a focusing 
effect, and therefore there is no assumption that it took place 'for 
a purpose,' as required for the applicability of (2.23). The choice 
of the marked form is still "an 'unnecessary complication' (see (2.22)), 
rjid one might assume that the two NP's which switched places are not 
coreferential, if it were not for their reflexive form, which is 'an 
indication to the contrary.' Thus, (2.19b) is possible, as a 
stylistic variant of (2.19a). 

Third, Postal attempts to explain the failure of heavily 
stressed reflexives to obey the constraints by claiming that the HP's 
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involved are not presupposed to be coreferentisil at the stage at 
which their coreferentiality is asserted. Specifically, he derives 
(2.2l*a) from (2.2Ub) by a rather complex transformation which he 
calls Contrast-Movement. 

(2.2U) a. Bill cut himself. 

b. The onej Cwhoj Bill^ cut] was Bill^ (himself). 

I believe that if (2.2Ua) and (2.2Ub) are related transformationally, 
it is the former, rather than the latter, which is basic, and this 
claim will be defended in considerable detail in 2.1.3. The direction- 
ality of the transfonnation relating . (2. 2Ua) and (2.2Ub) is however 
not crucial here, for in either case Bill and himself are not pre- 
supposed to be coreferential. One possible explanation of the accept- 
ability of John was shot by himself is that the focus of a sentence is 
in most cases its subject, therefore a change of subject by one of 
the bounded rules at issue effects a change of focus; however, if there 
is a stressed constituent in the sentence other than the subject, that 
one counts as the focus. Since the subject is no longer the focus, 
it can no longer be assumed that the change of subject was made in 
order to create a new focus; raiiher, it is assumed that the marked 
form was selectad for some other reason(s). The reason which determines 
such selection in a case like John was shot by himself is that this 
sentence presupposes John^ was shot by someone j (usually due to a 
previous assertion, such as John was shot by Bill ); thus, John wasn^t 
shot by Bill, he shot himself sounds much less natural than John wasn*t 
shot by Bill, he was shot by himself . Notice that this line of approach 
enables us to explain why *J6hn was shot by himself is bad; indeed, this 
-entence presupposes ^ someone-; was shot by himself . 

We are also in a position to understand why {2.20c) is better than 
(2.20b), a fact which Postal^s Contrast Movement rule cannot accotmt 
for. Indeed, the scope of even constitutes a focus, and (2.20c) has 
two overt signals that the subject is not the focus, namely, even 
and contrastive stress, while (2.20b) has only one such signal, 
contrastive stress. If the crossover phenomena are due to behavioral 
reasons, as I suggested, then iT; is quite natural to expect that two 
signeLLs- should be better than one in this situation, for they make the 
shift in focus easier to perceive. 

In stating Cross-Over, Postal restricts its applicability by 
using the notions of mention, peer, clause-mate (we have already 
mentioned pronominal virgin). V/hile these notions do indeed yield the 
desired resu. ts, it remains to be shown whether there is a deeper 
explanation why precisely these notions should have to be used in 
precisely the ways in which Postal uses them. In other v;ords, it is 
legitimate to ask whether these notions ?vre in. some way connected with 
the fact that there are cross-over constraints, or whether they 
constitute a set of arbitrary, unrelated consitions. I believe that 
the notions in question are natural consequences of the account I have 
.proposed. 

The requirement that only HP^s mentioned in the structural 
description of a transformation be subject to cross-over restrictions 
(originally made in Ross, 1967), is a totally superfluous requirement 
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in the framework I am proposing. Consider for example (2.25b), 
which is acceptable" — according to Postal — because himi is not 
mentioned in the structural description of Subject-Raising, which 
moves it over Johnj . Observe, however, that (2.23) does not mark 
(2.25b) as odd, because the focus of the unmarked form is John, 
while the focus of the marked form is the woman who loves him , and 
the attempt" to create a new focus does not fail. 

(2.25) a. It seems to John^^ that the womani^who loves 

himj^ is sick. 
• b. The woman who loves himi seems to John^ 
to be sick. 

Similar arguments will be seen to apply in the case of the unbounded 
rules . 

The notion of peer is meant t^o account for the, acceptability 
of (2.25b), where the two coreferents are not peers. 

(2.26) a. Johni^s wife struck him^. 

b. John^ was struck by his^ wife. 

This specification is again unnecessary in my account,, for the 
unmarked form focus is John^s wife , while the marked form focus is 
John, and (2*23) is inapplicable. 

Finally, Postal's Clause-Mate Restriction must apply to NP's 
which are clause-mates before one of them crosses over the other, in 
order that the crossing movement may be blocked. This causes serious 
difficulties in relation to Subject -Raising (or It-Replacement ^ as 
Rosenbaum called it ) , because the coreferents are not clause-mates 
in the input to this rule. Postal proposes to obviate this difficulty 
b3'' breaking It-Replacement into two steps, one which raises the 
complement subject to object position in the matrix sentence and 
one which flips the new object to subject position. Thus, rather 
than derive (2.15b) from (2.15a), he proposes that the source of 
these sentences is to me seems that I am stupid , which yields to me 
s eems I to be stupi d, and ultimately *I seem to myself to be stupid . 
This account fails at the level of observational adequacy,, since 
speakers do not, in general, have the same intuitions about the 
severity of violations arising through Subject-Raising and through 
clear cases of the putative Flip rule. 

In my account, if a clause-mate requirement were needed, it 
would be needed only in the output of the various putative trans- 
formations, a condition fulfilled in all of (2.li|)-(2.l8) . This is 
so because the applicability of (2.23) can only be judged with respect 
to the marked form. For (2.23) to reject a marked form, the new focus 
must of course be a clause mate of the unmarked form focus, since 
focus by bounded rules is precisely created by causing an NP to occur 
as an earlier argument of some verb . 

We shall now consider the restrictions on unbounded rules, 
which I shall ^^^ry to show are explainable, to a large extent, by 
(if. 22). Let U3 briefly consider the facts and Postal's solution. 



- 68 - 



Postal concentrates on three unbounded rules: V/h-Q-Movement , 
\NTi-Rel-Movement , and Y-Movement. I shalL^also consider Clefting, 
which is interesting to compare with Y-Movement, on account of formal 
similarities. The data to be explained are given in the paradigm below. 

(2.27) a. Maryi thinks that Bill loves her^. 

b. vmoi thinks that Bill loves her^? 

c. *Whoi does Mary^^ think that Bill loves? 

(2.28) a. The girli Cthe girli knows Bill loves the 

girlj^D is pretty, 
fc. The girli ^^Oi knows Bill loves her^ is pretty, 
c. *The girli ^^o^ she^ knows Bill loves is pretty. 

(2.29) a. Mary emphasized Johni must think that Jill 

loves him^. 

b. Johni, Mary emphasized must think that Jill 

loves himi. 

c. *Himi, Mary emphasized Johni must think that 

Jill loves. 

(2.30) a. John talked to Mary about herself. 

b. Mary, John talked to about herself. 

c. Herself, John talked to Mary about. 

(2.31) a. John^ thinks that Mary loves him.. 

b. It's Johni "^^^ thinks that Mary loves himi. 

c. It's himi *^ohn^ thinks Mary loves. 

(2.32) a. John talked to Mary about herself. 

b. It's Mary John talked to about herself. 

c. It's herself John talked to Mary about. 

I 

(2.33) a. *CV7hosei mother] does John^^ admire? 

b. *The mani [whose. motherD hei hates is a freak. 

c. CHisi motherD, John^ admires. 

One similarity between the restrictions on bounded and unbounded 
rules is that neither seems to be universal. This is probably a 
consequence of the fact that both (2.22) and (2.23) are perceptual 
principles , but not lof^ical necessities . 

Another similarity between the two kinds of rules is that 
restrictions apply only to NP's mentioned in the index of those rules. 
Thus, Postal points out V/h-Q-Movement is free to operate in (2.3^), 
because it is the \rti-word, and not him, which is mentioned in the index 
of these transformations. 

(2.3^) a. Charleyi visited some of the men who criticized 
himi. 

b. CWhich of the men who criticized himjD did 
Charleyi visit. 
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In contrast to bounded riiles, unbounded rules obey the constraints 
even when the crossing does not involve clause-mates, either in the 
input or in the output, as can be seen in (2.27)-(2.29) . 

Another difference is that crossing restrictions are not always 
restricted to peers. As can be seen in (2.33), the constraints hold 
even for non-peers in the case of T-m-Q-Movement and Vh-Rel-Movement , 
although not in the case of Y-Movement. In other words, even though 
the moved NP^ is not a coreferent of any other NPj that it crosses 
over, unacceptability may arise if the moved UP^ contains an NPj^ 
ooreferential with NPj . However, why should unacceptability arise in 
(2.33a) and (2.33b) but not in (2.33c)? Postal invokes Ross' notion 
of Pied-Piping, which says that if a wh-word is part of a larger NP 
(and if some additioneUL conditions are satisfied — for which, see 
Chapter One, (1.22) — ) the larger^NP may be moved along with the wh- 
word; if, however, the wh-word is on a left branch, the larger NP must 
accompany the wh-word. As (2.33a) and (2.33b) have the wh-word on a 
left branch. Postal proposes to regard the wh-word as the mentioned HP, 
and to ascribe the fact that it did not move alone to the LBC. On the 
other handi Postal claims that Pied-Piping is not applicable to Y- 
Mpvement, and thus (2.33c) is grammatical because his^ cannot be the 
mentioned NP. However, Postal does not ask why Pied-Piping should be 
inapplicable to Y-Movement. Notice that Pied-Piping does not depend 
on a wh-word; rather, it seems to require that the mentioned NP shoixld 
be identifiable by some overt surface mark, thus suggesting that 
perception is somehow involved. This hypothesis is supported by the 
fact that Pied-Piping is_ applicable to Clefting, which differs from 
y--Movement only in allowing the identification of the clefted element 
by contrastive stress. Thus, it is reasonable to regard h is mother as 
the mentioned NP in (2.33d), but his as the mentioned- one in (2.33e). 

(2.33) d. It's Chis^ mother] John^ admires most. 

e. It's Chis^ mother] John^ admires most. 

A third difference between the two types of rules is that none of 
the starred sentences in which cross-over violations by unbounded 
rules occur is unacceptable on all its readings . Thus, for every 
starred sentence in (2.27)-(2.33) there is a reading on which the crossed- 
over NP's bear different indices, i.e., they are not presupposed to 
be coreferents (which does not mean 'that they are presupposed to be 
non-coreferential, for a perfectly natural answer to (2.27c) is Mary 
herself ; also, the following is semantically well-formed: I know who 
Meiry thinks Bill loves, it^s Mary herself , etc.). Moreover, when the 
moved NP is a reflexive as in (2.30) the coreferential interpretation 
is both semantically possible and acceptable (or grammatical); Postal 
accounts for this fact by pointing out that reflexivization precedes 
unbounded movements, and that the moved reflexive is not a pronominal 
virgin. 

Finally, it should be pointed out that there are no oross-over 
constraints on Clefting—as seen in (2.3l)-(2.32)~at least for the 
dialect in question. 

In proposing an -alternative td Postal's account, we nhall have to 
answer at least the following questions: 
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(2* 35) a. \7hy do V/h-Q--Moveinent , \7h--Rel-Moveinent , and 
Y-Movement obey the constraints, but not 
Clefting? 

b. \Ihy do unbounded movements not result in 

unacceptabilities, and why are reflexives 
exempt from the constraints? 

c. Why are the constraints confined to "mentioned" 

NP's? 

d. V/hy is Pied-Piping inapplicable to Y-Movement? 

With respect to 4*^f^) , we shall first inquire whether the 
constraints can be explained by either (2.22) or (2.23). A quick 
comparison of the paradigm in (2.27)-(2.33) with the one in (2.ll*)-(2.l8) 
will show us that the choice is between the input and the output of 
some tranrfonnation in the latter, but between two outputs in the former. 
Out of the four unbounded rules considered, only Wh-Rel-Movement and 
Y-Movement qualify for either (2.22) or (2.23), since it is only for 
these rules 'that the two possible outputs (i.e. .> the one in which the 
closest possible candidate has moved and the one in which the more remote 
has done so) are cognitively synonymous. Indeed, there is every reason 
to assume that both outputs are derived from the same underlying 
representation: the Rel-Wh- feature can be placed on any coreferent of 
the head, without semantic effect, and ' Y-Movement , which fronts the 
topic of the sentence, achieves the same effect by moving any of several 
coreferents. On the other hand, when different coreferents are 
questioned or clefted, the results are not cognitively synonymous, as 
the various outputs have different underlying representations and 
different presuppositions. Indeed, the source of (2.27b) is someone^- 
t hinks that Bill loves her-j^ while the source of (2.27c) is ^ Mary^ ^ 
thinks' that Bill loves someone j. With respect to clefted NP^s, they 
are semantically equivalent to contrastively stressed ones, so that 
(2.31b) and (2.31c) are related to the semantically distinct Joh n 
thinks that Mary loves him and John thinkg that Mary loves him , 
respectively. Gome confusion may arise between clefted and topicalized 
sentences, because informants may involuntarily stress NP*s .fronted by 
Y-Movement, which makes them indeed equivalent to clefted ones. However, 
if one is carefxil not to put constrastive stress on WP*s topicalized 
by Y-Movement, the different behavior of the two rules becomes clear. 
The paradigm in (2.36) shows that clefted reflexives behave like 
stressed ones and topicalized reflexives behave like unstressed ones 
with respect to the constraints. 

(2.36) a. *John was cut by himself. 

b. ^Himself , John was cut by. 

c. John was cut by himself. 

d. It^s himself John was cut by. 

As Clefting does not obey the constraints, there is nothing to 
explain, and the inapplicability of (2.22) or (2.23) follows from the 
fact that the various outputs are not cof^nitively synonymous. The 
problem will be to show why the constraints hold for questions, since 



the various outputs are not derived from the same deep structure, or 
from synonymous deep structures. Thus, questions will have to be 
explained by a different principle than relatives, which I do not 
regard as a weakness of my analysis, since there is no logical necessity 
that questions and relatives receive the same explanation, etlthough 
Postal assumes such a necessity without comment or justification. 

We have established that when several outputs are possible for 
V/li-Rel -Movement and Y-Movement, these outputs are expected to be 
synonymous, as they -are- derived from the same underlying representation. 
I do not see how the fact that the more remote coreferent- cannot be 
relativized or topicalized can be accounted for by (2.23), since I do 
not see what could qualify as a ^purpose* . In the case of the bounded 
rules, the purpose was interpreted as the creation of a new focus, and 
this is inapplicable here. I will therefore assume that (2.23) is 
inapplicable, which already provides a partial answer to (2.35b), since 
(2.23), but not (2.22), can mark a sentence as unacceptable.- To see 
whether (2.22) is applicable in these cases, we must first establish 
that the choice of a more remote coreferent vould * complicate matters* " 
in some way. I believe that such a choice would indeed constitute a 
complication, especially in view of the fact that the linear distance 
and the number of intervening sentenfcte nodes between the two candidate 
NP^s could be arbitrarily large, since the rules involved are unbounded 
ones. Indeed, the choice of the more remote constituent woixld unneces- 
sarily strain the hearer *s memory, by forcing him to await the ^resolution* 
of the fronted constituent — i.e., its ^plugging* into the constituent 
to which it immediately belongs, thereby revealing one of its roles^ — 



TFronted constituents play at least two roles: they serve as com- 
plementizers or topic-introducers in their surface position, and as 
arguments of the predicate of some lower clause in thfeir underlying 
position. 



for a longer period of time. I do not know whether the greater 
complexity involved in the choice of a more remote coreferent is 
reflected in the process of acquisition; that is, I do not know wuether 
children first learn to relativize or topicalize out of the first 
sentence down, then out of the second, and so on. Such complexity has, 
however, experimentally been shown to exist for the adult due to the 
following interruption principle (Sever, 1970); 

(2.37) Discontinuous components are perceptually complex 
in proportion to the structural complexity of 
the intervening material. 

Notice that in the output of neither rule is there an indication 
that the two coreferents must be coreferential, as personal pronouns 
need not be coreferential with anything but the relative clause head. 
Therefore, the explanation of the unacceptability of (2.27c) and (2.29c) 
is quite simple : "given the fact that discontinuities increase in 
complexity with the complexity of intervening material, there is a 
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reasonable assumption that discontinuities will be as 'smcdl* as 
possible; as there is no • indication , to the contrary*, i^e., no overt 
mark of necessary coreferentieLLity , a hearer assumes that the 
discontinuities in the two above-mentioned sentences are minimal, 
and that there is no coreferent of the fronted between its surface 
and underlying positions. The situation' is different with respect to 
(2.30c), as the fronted reflexive does constitute an overt mark of 
necessary coreferentiality . The hearer knows that a fronted constituent 
plays two roles (as pointed out in footnote He also knows that 

a reflexive must have a clause-mate coreferent. As no such clause- 
mate can exist in its 'surface position, the conclusion is unavoidable 
that it must exist in its xinder lying position. As the hearer also 
knows that the clause-mate must be to the left of the reflexive, a 
topicalized reflexive already tells him that the latter must have 
'crossed* a coreferent. Therefore, the coreferential interpretation 
is possible (in fact, necessary) because (2.22) is inapplicable, and 
this interpretation is well-fonned because (2.23) is inapplicable. 
Notice that the explanation we have proposed has also provided an 
answer to (2.35,b). 

Principle (2.22) can explain an interesting dialectal peculiarity. 
Thus, the paradigm in (2.38) holds for some speakers: 

(2.38) Mary told John^ that she loves him^. 

b. *John^, Mary told that she loves him^. 

c. ^Himi , Mary told John^ that she loves. 

d. John, Mary told that she loves.. 

Neither of the two coreferents can m«ve alone for these speakers, 
under conditions that I have not investigate.d^arii;, detail. For our 
purposes, it is sufficient to notice that chopping must occur *across- 
the-board* when both coreferents occupy object position. Ro.ss (1967) 
pointed out that the across-the-board condition constitutes an 
absolute exception on unbounded chopping rules (in the sense of Lakoff , 
1970a) in coordinations. That is, it is not only the case that chopping 
must occur in all the terms of a coordination If each contains a 
coreferent, but also each coordinate t erm must contain a coreferent 
in specific positions if chopping is to occur at all. We can .see that 
the across-the-board condition overrides the strategy which requires 
that unbounded movements be as economical as l^ossible. 

V/hile the across-the-board condition in coordinations is a very 
strong, and, possibly universal, one, there appears to be a weaker 
condition with respect to other structures; moreover, the strength 
of this condition varies dialectally. Thus, for the dialect in 
question, across-the-board chopping is mandatory in structures like 
(2.38a), but only if the matrix and the complement contain coreferents. 
Thus, we are hot dealing here with an absolute exception, for John, 
Mary told that she loves Jack is perfectly well-formed; in contrast, 
* John, Mary loves and Jean hates Bill is not. Moreover, (2.38b) is 
not ill-formed for the speakers in question on all its readings, but 
only with the linkages indicated. It is interesting that if the link- 
ages indicated are impossible in (2.38b) in the dialect at issue, there 
are many speakers who accept (2.38b), but find it odd and prefer (2.38d) 
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instead. Ross (1967) mentions a number of adverbieJ. constructions 
which differ from both cases mentioned above in that chopping is 
precluded out of the adverbial, but possible out of the matrix or 
out of both; however, if the environment for across-the-board c 
chopping exists, chopping out of the matrix 'alone is sometimes odd; 
more specifically, the acceptability of chopping out of the matrix 
alone when the adverbial contains a coreferent seems to be inversely 
proportional to the acceptability of across-the-board chopping in the 
same cases. 

We shall return to across-the«-board chopping later, and will try 
to provide an explanation for the fact that across-the-board chopping 
is sometimes an absolute exception and sometimes not, as well as for 
the fact that it is possible in matrix and adverbial constructions in 
which the adverbial alone cannot lose an element through chopping 
vinder any circumstances. For the time being, we shall only notice 
that the reasons for the vinacceptability (or marginality) of (2.38b) 
are fairly straightforward: while (2.38d) creates an additional 
discontinuity, as compared with (2.38b), it also makes it perfectly 
vinambiguous that the object of the matrix and that of the complement 
are coreferents; on the other hand, (2.38b) leaves the referent of. 
him ambiguous, for although the linkages indicated are a possibility 
in some dialects, they are nowhere a necessity. Thus, (2.38b) appears 
odd, because the hearer may assume that the speaker will not clLIow his 
discourse to be more ambiguous than necessary; in fact, we seem to 
be dealing with a clash of two strategies: one that assumes that 
discontinuities are not increased without reason, and one that assumes 
that ambiguities are not proliferated without reason (both are 
instantiations of (2.22)). The dialectal variation in the acceptability 
of (2.38b) reflects the reasonable assumption that neither strategy 
is overwhelmingly stronger than the other; the way in which the 
conflict at issue is resolved determines the particular degree of 
acceptability of such sentences in individual dialects. 

The across-the-board condition was illustrated with Y-Movement 
only in (2.38), but it holds for all vuibovinded movements. For example, 
it holds for Clefting (which is in general8 free from cross-over 



"I have found that the speakers who accept (2.31b) reject (2.31c), 
instead of which they say It^s himself^- John-; thinks Mary loves . This 
fact is again explainable by (2.22) , since, so long as the reflexive 
is possible in this situation, it is clearly a less ambiguous, and 
^therefore simpler, choice than the personal pronovin; thus, the use 
of a personal pronovin creates presumption of noncoreferentiality in 
this dialect. 



constraints), as shown in (2.39) (as in the case of Y-Movement, the 
acceptability of (2.39b) is subject to dialectal variation). 

(2.39) . a. It*s John Mary told that she loves. 

b. *It*s Johni Mary told that she loves him^. 
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The reason is that (2.39a) has the same meaning in the dialects in 
question as (2.3Sb) purports to have (or that it has, in the 
dialects in which it is acceptable). As a choice between two 
synonymous forms exists, (2.22) forces the choice of (2.39a). 

I shall now attempt to provide an explanation for the behavior 
of questions. i 

Postal (1971) notes in Chapter 21 that Jackendoff has proposed ^ 
at the 2nd Annual La Jolla Conference on English Synteix, that vh- 
words are wh+some words, and that the ill-formedness of (2.U0a) is 
due to the ill-formedness of its source, (2.U0b). 

(2.U0) a. nuho^ did she^ claim Jack kissed? 

b. *Shei claimed Jack kissed, someone^ . 

Postal rejects Jackendoff *s proposal, but since he later admits in 
Posted (1970a) 9 that he had done so for ^largely wrong reasons,^ I 



Postal (1971) first appeared in 1968. 



shall not try to show that Postal ♦s initial criticism was mistaken. 
It will be interesting, however, to look at what Postal regards (in 
the more recent paper) as ^right reasons^. Postal gives four arguments 

(2.Ul) a. In order to make the 8U*gument go through, it 
is necessary to show that restrictive and 
appositive relative clauses obey the 
constraints for the same reasons as questions, 
b. It is not true, as Jackendoff claims, that some- 
words can never refer backwards to a personal 
pronoun, as (2.U2a) shows. Moreover, there 
are a number of cases "in which the wh-word is 
not fronted, e.g., (2.U2b),and in those cases 
the wh-word can refer to a preceding pronoun. 

(2.U2) a. Although he^ thought he^^ shoulO run, I 

explained to some man^ (l know) that the FBI 
would catch him^^ in no time, 
b. The nevsman who criticized him^ later belted 
which official^? 

(2.Ul) c. The constraints hold not only for wh-words, but 
also for a certain class of nominals which 
contain them. But although a sentence like 
(2.U3a), is ill-formed, its source, (2.U3b), 
is well-formed. 

Ci 

(2.U3) a. *Whose^ father^ ♦s brotherj^ did his •< j 
sister criticize? (^k^ 
^i 




0. His ^ j > sister criticized the brother, of 
the father j of someone^. 
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(2.Ul) d. In question clauses with more than one vh-word, 
the wh-word which does not move can refer 
backwards to a pronoun. However, if that 
wh-word is fronted, it can no longer refer 
forwards to that pronoun, as shown in 

(2.UU) a. V/liat company had his^ wife spy on what well- 
known industrialist^^? 
b. *Vhat well-known industrisdist^ did the company 
have hisi wife spy on? 

With respect to (2.Ula), I think it is sinply vronR. Indeed, if 
the same constraints obtain in two separate instances, it might be 
desirable, but certainly- not necessary , to show that the same cause 
determines the restrictions on both cases. 

With respect to (2.Ulb), I think that Jackendoff^s claim is too 
strong, due to his failure to distinguish between the properties of 
stressed and unstressed some . Although I have not conducted an 
exhaustive investigation of some, I wish to propose the following 
tentative generalization, based on Postal ^s examples: 

(2.U5) a. unstressed some cannot refer backwards to a 
pronoun. 

b, stressed some can refer backwards to a pronoun, 
subject to the general constraints on back- 
wards pronominalization (i.e., some cannot 
be both commanded and preceded by the pronoun). 
By way of illustration, consider^ (2.U6) and 
(2.U7). 

(2.U6) a. That he^^ was sick disturbed John^. 

b. That he^ was sick disturbed some boy^^. 

c. *That hSj^ was sick disturbed some boy^^. 

(2.J*7) a. That his^ mother was dying disturbed John^. 

b. That his^ mother was dying disturbed someone^. 

c. *That his^ mother was dying disturbed someonej^. 

If v^e assume that question wh-words come from the some which 
ultimately receives stress, rather than from the one which does not, 
and if Jackendoff •s proposal is modified to incorporate this assumption. 
Postal ^s argument loses its strength. Indeed, consider Postal •s 
example (36), which I reproduce below as (2.U8a) and (2.U8b) and 
which purports to show that questions can be well-formed when the 
underlying structure that Jackendoff assigns to them is ill-formed; 
however, if the underlying structure is taken to (2.U0c), rather 
than (2.U8a), we see that the question and its nource af:ree in 
grammaticality. 



{2.UO) a. *The tyrant tortured her^ mother in front of 
some helpless princess j^. 

b. WJiat tyrant tortured her^ mother in front of 

what helpless princessj^? 

c. The tyrant tortured her^ mother in front of 

some helpless princess j^. 
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There are independent reasons for believing that interrogative 
jrti-words coiae from the some which ultimately gets stressed. First, 
interronative wh-words also receive stress; this fact, taken in 
isolation, cannot be ascribed too much significance, for relative 
wh-words never receive stress, but their sources have in general been 
regarded as full-fledged iiP's, which can be stressed. However, the 
stress correlation, together with the second fact I shall mention, 
does carry weight. Thus, stressed some has been described as 'specific 
whole unstressed some has been described as 'non-specific!. 
Specificity is a complicated notion, much more so than it had initially 
been thought, and must be defined with respect to (often proper) 
subparts of sentences (Dean-Fodor, oral communication). Thus, in 
(2.U9a), someone is specific with respect to John, but not with 
x-espect to the speaker of thir sentence . 



Sim:ilarly, in {2.h9^)y vho is specific with respect to John rather 
than with respect to the speaker of the suntence. The hypothesis 
that wh-words are specific can explain the fact that ■(2.l49c) is non 
tautologous, for (a.U^d) is not tautologous either, but {2.k9e) is. 



Making the notion 'specificity* precise is a task far beyond the 
scope of this discussion, but if it can be argued that interrogative 
w_h_-words are always specific wiih respect to at '.ast some clause, this 
will be sufficient motivation for regarding such words as derived 
from specific some, which will account in a straightforward fashion 
for (2.1*8). ' 

With respect to {2.klc), I can report that I have presented 
various native informants with fi more complete version of (2.I43), 
which I render below as (2.50), and that none accepted (or had heard 
speakers vho accepted) any of the sentences in (2.50) with the J or 
k linkages. 



(2.1*9) a. John thought that someone was in the buildi 
b. John wondered who was in the building. 



(2.1*9) c. Vifhere is John? 

d. John is somewhere. 

e. John is somewhere. 




Tlie brother, of the father, of whom,- did 




hisi y sister criticize" 
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■ His,/" sister criticized the brotheri, of 



the father i of someone,. 



1 



Until more data can be gathered which corroborate Postal's, I 
think we can re^iard his data as suspicious and far-fetched. But even 
if ve accept them, Postal^s case is not yet established. Indeed, 
Postal discusses only (2.50a) and (2.50d), which he assumes are 
transformationally related, a not at all obvious point (and not a 
universally accepted one). But even concedini?; this point, we cannot 
be sure that crossing over a cojreferent is responsible for the unaccept- 
ability of (2.50a) with the ^ and k linkages, so long as Postal does 
not tell us what his Judgments are"about (2.50bK If he rejects it 
(which is suggested by his failure to discuss it), the xxnacceptability 
of (2.50a) might be^due not to cross-over, but to the preposing of 
genitives. 

With .respect to (2.1ild), 1 have also been unable to find anyone 
willing to agree with In fact, every informant felt that the 

b-sentence is better than the a-one, although both were Judged 
acceptable. Postal himself points out that the unaccept ability of 
(2.Hb) is dialect-restricted, arid I would guess that the dialect in 
question constitutes a definite minority. In any event, the existence 
of speakers who accept (2.1*3) or {2M) does not constitute a difficulty 
for my proposal, for it is perfectly possible that certain language 
learners may reinterpret certain grammatical facts, when the crucial 
counter evidence is far.rly rare and may not be encountered at the 
pertinent moment. .^Thus, rather then extracting the generalization 
that questions are ill-formed whev their soiarces are ill-formed^ a 
learner may infer that questions are ill-formed when there are 
coreferents of the question-word between its surface and deep positions. 
ouch» reinterpretation of data is perfectly possible when the facts do 
not unambiguously force some unique interpretation for example, 
Carden cites the case of a family in which every membei spoke a 
different 'dialect* with respect to quantifiers and negation. There- 
fore, although certain speakers may have reinterpreted the cross-over 
restrictions on questions as purely formal ones, it is still the case 
that these restrictions can be given a natural explanation for a 
large number of speakers. 

There remains to answer questions (2.35c) and (2.35d) aiid th^ 
cnswer to bo^^h is quite simple. Concerning (2.35c), the constraints 
are confined to mentioned NP's for Wh-Rel -Movement , because it is only 
in this case that ,there e>:ists a choice between two synonymous trans- 
formational outputs. Thus, if a crossing KP displaced by V/h-Rel- 
Movement were not the one mentioned, i.e., were not a wh-word, the 
head of the relative clause would not be coreferential with it, and 
its crossed coreferent could not have been relativized. With respect 
to questions, where the constraints are due to general constraints on 
pronominalizatiqn involving stressed seme, non-mentioned HP's are 
exeupt, so long as they do net violate pronominalization constraints; 
for example, him is not prevented from referring to Charley either in . 
(2,3Ub) or in its source, (2*3Ua). Finally, non-mentioned N?»s are 
exempt in the case of Y-Movement, because the mentioned UP is different 
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fro.*n the corefereut of the uninentioned one, and the sentence is 
different in focus from the one that would result if zne crossed 
coreferent were topicalized. Thus, the pertinent output is not ruled 
out by {2.22) y since the complication has not been 'unnecessary', and 
it is not ruled out by (2.23) » _since the purpose for which a more 
remote IIP was chopped is a valid one. 

As far as (2.22) is concerned, fronting is the only means of 
identifying the mentioned UP in Y~Movement, but not so for questions 
and relatives, where the mentioned K? is . identified by the wh- feature. 
Therefore, even if Pied-Piping applies, it is still clear which 
element is questioned or relativized; but if Pied-Pip ig were applic- 
able to Y--Movement, it would be impossible to know what is topicalized. 
If it can be aurgued that chopping; a larger NP is perceptually more 
difficult than chopping a smaller one, the situation can be explained 
by (2.^2), as the hearer will always assiome that the smallest possible 
NP has been moved, unless he is given a clear counterindication in the 
form of the wh~fe.ature. 

As I have pointed out already, any overt feature identifying 
the mentioned NP should do, whether it is wh (as in questions and 
relatives) or h eav?/- stress (as in Clefting) (see (2.33)). 

2.1.3. This section is concerned with the complexity of i nterrupted 
behavior (brought up in the preceding section in connection • ith the 
cross-over restrictions on Relativization and Topicalization) , and 
especially with two structural discontinuities that must be handled 
concomitantly, which, as argued in 2.0.1 , ,create a comparatively hi/^h 
degree of complexity. The center of the discussion is the pseudo- 
cleft construction in English, a topic of considerable interest in 
its own right. Numerous analyses have been proposed for pseudo- 
clefts; none of them is entirely satisfactory, but I shall argue that 
an aneilysis based on the 'extraction' of the pseudo-cleft phrase comes 
closer to being adequate than analyses of other types so far proposed. 
The analysis 1 defend is weakened to some extent by the use of some 
ad hoc formal apparatus, but I argue that this dravrback does not 
invalidate th: essential claim of the section, namely, that the 
pseudo-cleft phrase is a member of a structural discontinuity , and 
that pseudo-cleft senterices' become perceptually complex when that 
discontinuity is combined. .with another one, such that the two are 
improperly nested. I also argue that the ad hoc apparatus I use is 
a consequence of the presently available descriptive models which do 
not allow a viable alternative to movement transformations in stating 
the existence of discontinuous components. Ultimately, the purpose 
of this section is to show that the low acceptability of concomitant 
double discontinuities is* (at least in part) responsible for the 
phenomena •hich led to the postulation of Ross' Right Roof Constraint 
(see (2.ii8) in section 1.2.h). Movements to the right may create 
several types of perceptual difficulty, and Ross, on the basis of a 
number of cases exhibiting Krammatization concluded that the 
HRC is a purely formal constraint on grammars. Basing himself on 
Ross' claim. Bach (1971) attempted to provide a principled explanation 
for the observation that COV languages do not seem to have a question- 
movement rule. Bach made the general assumption that syntactic 
categories and major transformations are identifiable across 
languages, as well as the following four specific assumptions about 
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question-mover.ent rules :* 



(2.51) a. The quesoion phrase is based on an indefinite 

pro -form; 

b. The question movement rule is unboxinded; 

c. The movement is to the front of the sentence 

or toward a f^overning verb (in fact, in the 
performative analysis, which Bach accepts, 
the movement is always toward a governing 
verb); 

d. The RRC. 

Assumptions (2.51b)-(2.51d) above can account for the absence of 
question-movements in a large number of derivations in SOV languages. 
Indeed, if the movement must proceed in one swoop, which is consistent 
with (2. 51-), rather than by cyclic 'hopping', and if the movement is 
toward a governing verb which is necessarily to the right of the phrase 
to be moved, it follows that there will be a large ntimber of derivations 
in which the question phrase will have to move to the right and outside 
the dominance of the sentence node which most immediately dominates it. 
Since such novements are precluded by the RRC, (2.51) successfully 
accounts for the lack of question phrase movements in such cases. 

Hcvever, as illustrated in (2.52) with a Japanese example, 
question^ovements never occur in SQV languages, not even when the 
question phrase, belongs to a clause immediately embedded to the ■ 
governing verb and when question mo^^ements would not violate the RRC. 

(2.52) a. v/atakusi-wa dare-ga pan o tabeta no ka to kiita. 

I who bread ate asked 

*I asked who ate the bread.' 
b. *Watakusi-wa pan o tabeta no ka dare-ga to kiita. 



To account for the ungranmaticality of (2.i;2b), Bach makes the 
following additional assumption: 

{2.'j3) Assumptions like (a) to (d) Cin (2.51)3 are 

assumptions constitutive of possible trans- 
formational components, rather than possible 
derivations. 



Let us call the position expressed in (2.53) the Strong Trans- 
formational Position , and the position upheld in Chomsky (1962. 1971) 
and Ross (1967) the V/eak-Transformational Position . Tho latter claims 
that a transformation is blocked only when its application violates some 
constraint, the former claims that if a transformation violates a 
constraint in some derivations, then it can never apply in any 
derivation. It is clear that (2.53) can account for the lack of a 
question-movement rule in SOV languages (or for the fact that such a 
rule, if it exists, is- always blocked), but it is al.50 clear that 
the Strong Transformational Position cannot be correct in general, for 
practically every transformation I can think of is subject to some 
constraint, and the position in question would in fact prevent all 
transformations from ever applying. 



. 80 . 

Bach is not explicit on which constraints he "ants to be subject 
to (£i.!>3), b\it we can be reasonably sure that ne does not regard all 
transfornations as strongly constrained' in the sense of (2.53); he 
was undoubtedly aware that other constraints of Ross', such as the 
Cii'PC or Crx\ sonetimes block Relativization or 7,uestion-Moveinent , but 
do not, for xhat matter, push- these rules out of the {^ramnar. ^>inco 
Bach does not elaborate on this natter, I will interpret his phrase 
'assunptions like (a) to (d)' in (2.53) as 'the assump-^- *ons (a) to 
(d)'; that is, I shall assume that he only claimed the RRC to be a 
strong transfornatioral constraint. 

There is clearly something unsatisfactory about postulatinf^ 
stron/; and weak transformational constraints vithout some principle 
able to explain. why certain constraints are strong rather than weak* 
I will attempt to s^how in this section that the HRC is neither a strong: 
nv-r a weak transformational constraint, but essentially a behavioral 
one. The fact that there are apparent strong transformational 
constraintr need mean no more than that grammatized perceptual 
limitations have been generalized. 

For my claim concerning the RRC to be empirically verifiable, it 
is necessary to find cases in which the pertinent restrictions are 
not grammatized (and, preferably, some plausible justification for 
r;rannatisation in the cases in which it has occurred). Specifically, 
V- must find at least one instance in which elements are allowed to 
move indefinitely far to the right, but in which the result varies in 
acceptability according to the values of some (set of) para!neter(s) . 
I i>ish to claim that there are at least two cases in point in fJnrlish: 
(a) the rule which moves pseudo-cleft phrases (following linmonds 
(iy69a), I shall call it F ocus Placement ) and whose discussion forms 
the bxilk of this section, and (b) the ru3.e of Coordination-Reduction 
from left to right, which is briefly touched upon in note 19 (for an 
extensive defense of this rule, see Hankamer, 1971). As I have pointed 
out already, the extraction analysis involves an undesirable formal 
f eat ure namely , the postulation of empty nodes in underlying repre- 
sentation; such empty nodes are no great embarrassment for tho Extended 
Standard Theory (Ercmonds, for example, uses empty nodes quite freely, 
and, T shall argue, too freely), but they have no place in Generative 
Semantics which claims that only well-fomed or meaningful semantic 
materia has a legitimate place in underlying representation. I will 
attempt to show, however (in this section and in Appendix One), that 
my analysis, while postulating semantically irrelevant material, does 
not postulate ill -formed material in underlying representation; in 
addition, I will suggest a way of severely limiting the use of empty 
nodes. On the other hand, the analysis proposed by such generative 
senanticists as Hoss or iiach does involve an i ncoherent underlying 
representation, :uid must therefore be judged inferior to mine. In 
fact, I do not know of any analysis so far formulated whicli enable both 
a semantically justifiable underlying representation and an observa- 
tionally« adequate statement of the pertinent syntactic generalizations; 
therefore, until a more adequate analysis becomes possible, I propose 
the provisional adoption of the account which follows. 

Consider ^he pseudo-cleft sentence in (2.5^). Let us call (2.5Ub) 
the non-pseudo-cleft counterpart of (2.5^a), and the underscored 



p 
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constituent, in either (2.5^a) or (2*5^b), the focus, 

(2.5^) a. What John ate was an apple > 
b. Joiui ate an apple ♦ 

I make the following assumptions about the pseudo--cleft construction: 

(2o5) a* The source of a pseudo--cleft sentence includes 
its non-pseudo-clefted counterpart with 
relative clause status. Specifically, I 



propose that a renote representation^ of 
(2.5^a) is roughly (2.56). ^ 



(2-5b) 




Tiie thing. wh John ate an apple was 

Presupposition ; John ate somethingj^ . 

The presupposition is the non-pseudo-cleft counterpart, except 
that the focus is replaced by a pro-form. The pro-form in the pre- 
supposition is the antecedent of the head of the relative clause in 
the underlying representation. This explains why the pro-form is 
indefinite and the head of the relative clause is definite 



•^^The problems involved in the semantic interpretation of pseudo- 
clefts are discussed in Appendix One. 



I 

ERIC 



(2.55) b. (2.5Ua) is derived from (2.5t») by 
(i) placing on the focus; 
(li) copying the focus into the empty Predicate 
by a rule of F ocus-Placement , such that 
" a pro-form is left behind; 

(iii; applying the rrle of Helativization to 
attract the resulting wh pro-form to 
the head of the relative clause; 
(iv) making optional morphophonemic adjustments. 
The result of applying Focus-Placement is 

unacceptable 
(i) if the instantiation of the essential 
variable-^-^ which intervenes between 



the end of the sentence which most 
inunediately contains the focus and the 
empty predicate is n on-n\xll , and 
(ii) in proportion to the size and the nature 
of that variable* 



■''■''See Postal (1971: Ch. 13) for a discussion of this term. 



First, I shall attempt to Justify (2.55a) and (2»55b), All 
fjrammarians who have concerned tnemselves with pseudo-cleft constructions 
have recognized that the source of the pseudo-cleft must somehow 
incorporate its non-pseudo--cleft counterpart. This is so for at least 
two reasons: 

(a) The selectional restrictions between the focus and its verb 
in the non-pseudo-cleft counterpart are the same as the selectional 
restrictions between the corresponding verb and the pseudo-cleft focus, 
as illustrated in (2.57); 

(2.57) a. John broke a glass. 

b. V/hat John broke was a pjlass. 

c. *Jomi broke a book. 

d. *V7hat John broke was a book. 

(b) Certain elements appearing in position of neutralization, 
such as ever , any, and reflexive pronouns, are grammatical only if 
commanded by the appropriate neutraJ.izing elements as shown in (2.58); 
however, this requirement need not be satisfied in pseudo-cleft 
constructions, provided that it is' satisfied in the non-pseudo-cleft 
counterparts, as shown in (2.59); the neutralizing elements are under- 
scored in both (2.58) and (2.59). 

(2.58) a. I do not beJieve that Mary ever harmed anyone. 

b. *I told the man who did not finish his dinner 

that Mary ever harmed anyone. 

c. John admires himself. 

d. *The girl who loves John also adnires himself. 

(2.59) a. V/hat I can not believe is that Mary ever harmed 

anyone. , 

b. I c anno t_ believe that Mary ever harmed [anyone . 

c. vv^hat the mis sile did was destroy itself. 

d. The missile destroyed itself. 



The fact chat the command-requirement with respect to neutralization 
is suspended in pseudo-cleft constructions, provided* that it is satisfied 
in the non-pseudo-cleft counterpart, is quite f^cneral in English. 
(<^.60)-(2.63) illustrate the point with Emphatic Reflexivization, Vi[h- 
Placeraent, and [Jequence of Tenses. 1 
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(2.60) a. Jill believes that the letter was written by 

. ^ r herself '\ 
^" Vhimself / • 
b. What Jill believes is that theJLetter was 
witten by Ann and /"-lerselfl 

(_*hiraselfj * 

(2.61) a. John l^^g^.^^^^J-v/here to go. 

b. Vrtiat John l^told ne^^ was where to go. 

(2.62) a. John {^^^^Jved}" ^^^^'i^' 

b. What John flj'^fl'^^®^ T is that he is stunid. 
(_*believed J 

A representation like (2.56) is, hovever, not the onlv loricallv 
possible way of incorporating the non-pseudo-cleft counterrart into 
the underlying representation of pseudo-cleft constructions. (2.56), 
which is essentially the representation proposed in Choraskv (1970) and 
Akmajian (1970), 12 has been challenged by Peters and Bach "(I96O) on 



12 

Actually, (2.56) differs in two respects fror; Chomsky's and 
Akmajian' s proposals, i-'irrt, it places the presuj osition in under- 
lying structure, while Chomsky and Akna.jian regard it as a feature of 
surface structure. Second, and more important, Chomsky proposes that 
the head of the relative clause is a dummy it, while Akmajian proposes 
that it is sometimes a genuine UP aftd sometimes a dummy, as in (i) 
and (ii) respectively: 

(i) The object thAt I saw was the house, 
(ii) V/hat I saw was the house. 

My om view is that all free relatives such as (ii) have genuine 
heads at some remote level of representation. Akmajian 's reason for 
postulating free relatives with dummy heads ir, that some free 
relatives do not have well-formed surface counterparts! vith genuine 
heads in his dialect; thus, (iii) is acceptable to him while '(iv) is not. 

(iii) Where John went was to Boston-r^ 
(iv) The place where John went was to Boston. 

J 

But surely this is no more than a surface phenomenon, because 
sentences like (iv) are acceptable to a great many speakers Jn. _ 
addition, headless relatives create much more serious problems, since 
the whole process of Relativization is based on there existing inside 
the relative clause a coreferent of the head- but how could a dummy be 
coreferential with anything? Notice also that overt forms of the " 
dummy rt, such as those associated with 'meterological' verbs 
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or resulting fron Extraposition, cannot be modified by relative 
clauses, as shown in (v) and (vi): 

(v) *It, which I dislike, is raining, 
(vi) *It, which nobody yet knows, is certain, that John 
has defected. 

For a convincing argumentation that free relatives introduced by 
}^iere, when, and while have meaningful heads in remote representation 
see M. Geis {191 6) . 



two grounds: 

(a) There is no natural way of carrying out step (2.55b.i) in 
such a way as to ensure that wh will be attached to only one NP. 

. (b) Since the pseudo-clefted constituent cannot be selected in 
the base, it must be selected by an attachment transformation; however, 
the pert-nent attachment transformation cannot be formulated, since 
the only WP's which can be pseudo-clefted are those which can be 
replaced by something with preservation of grammatically/, and' the UP 
to be pseudo-clelted and the verb with which it participates in 
selectional rescrictions may be arbitrarily far apart at the stage 
at which wh-attachment would have to apply. 

Peters and Bach (1966) conclude that any analysis which involves 
the 'extraction* of the focus from a non-pseudo-cleft construction in 
inadequate, and propose an alternative analysis, originally su/3:gested 
by Ross in personal communication: 

(2.63) -The -source of a pseudo-cleft sentence contains the 

non-pseudo-cleft counterpart in predicate position 
in construction with the copula, while the 
subject consists of "the NP t he thinf: modified 
by the non-pseudo-cleft counterpart in which 
s omething has been substituted for the focus. 

Specifically, the source of i2.5ha) would be (2.61*), rather than (2*56). 

(2.6k) s 




The thing John ate something was John ate an apple 
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In this analysis, the rule of Focus-Placement is replaced by a rule 
which deletes the, whole of the predicate except for the focus. 

It is rather easy to subvert Peters and Baches two objections 
to the extraction analysis and to show that their o\m proposal is 
inadequate on much more serious grounds. 

Regarding their first objection-, the difficulty they mention 
arises only because they propose to identify the focus by \^-placement , 
and because they propose to. allow any NP to occur, with wh-morphemes . 
This was understandable, in view of Chomsky's original proposal (see 
Footnote 12) which posited dummy-headed relative clauses for pseudo- 
cleft constructions. However, if we posit a meaningful head for the 
relative clause, the difficulty disappears, for the correct placement 
of \jh^ reduces to a problem which must ^be solved for relative clr^ses 
in general. ■'•^ one way in which this has been handled in the lite ature 



1 am grateful to A. Zwicky for pointing this out to me. 



has been to place wh_ on a coreferent of the head inside the relative 
clause-. (For ways of dealing with relative clauses which contain more 
than one coreferent of the head, see Postal (1971)). In the case of 
.structures like (^.56), if wh-placement precedes Focus-Placement (which 
is the order giv^n in (2.t)tJby because it is the one Peters and Bach 
criticize) , 'wh^ can be placed on the constituent of the relative 
clause which has no identical, counterpart in the presupposition « Ho* jver 
since the focus can be identified merely by comparison with the pre- 
supposition, Focus-Placement may precede Uh-Placement , and the latter 
can apply in perfectly straightforward fashion to the output of the"*? 
former, i.e., to (A1.8) of Appendix One. 

V/ith regard to" their second objection, it is simply incorrect 
that only HP's pronominalizable with something can be pseudo-clefted. 
Thus, a great many speakers accept not only T2.5^a), but also the 
sentences in (2.65), which Peters and Bach reject. 

(2.65) a. \^ere I met John was in l^aris. 

b. Vhen 1 saw Harry was in the afternoon. 

c. V/hv John left was to catch a plane. 

d. ?V/ho Nixon chose was Agnew. 

Clearly, however, ^the non-pseudo-cleft counterparts of (2.65a)-(2*65b) 
do not. tolerate the replacement of the focus vzith something , therefore 
the Peters-Bach proposal fails to generate a large number of pseudo- 
j:left sentences in a large number of diajects-.^ More important still, 
the Peters-Bach proposal fails to account for pseudo-cleft r>entences 
like (2.66),!^ which are undoubtedly acceptable in all dialects. 



That sentences like (2.66) are instances of the pseudo-cleft 
construction follows from the fact that they exhibit the two properties 
typical of pseudo-cleft sentences which were mentioned at the beginning 
of this section. Indeed, 
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(a) The selectional facts- exactly parallel those found in 
ordinary pseudo-cleft sentences, as can be seen by comparing (i) 
and (2,57) respectively. 

(i) a. John broke- a glass. 

b. The object which John broke was a ^l^-ss. 

c. *John broke a book. 

d. *The object which .John broke was a book. 

(b) The predicate can contain any- or ever-words when the 
latter are not .coiomanded by the neutralizing elements in surface 
structure, although this is impossible in general, as shown- in (ii) . 

[iij a. The insinuation which nobody could believe was 
that Mary had ever been involved with any 
shady characters, 
b. *The insinuation vMch nobody could believe 

suggested that. Mary had ever been involved 
with any shady characters. 

In addition, sentences like (2.66) are subject to two constraints 
typical of pseudo-clefted sentences which are discussed below in the 
text, namely, the CNPC and the RRC", as can be seen by comparinf; 
and (2.T^) with (iii) and (iv) below respectively: 

a. I suspect the idea which tortures Mil is that 
Mary loves John. 

b. *It's Bill who I suspect the idea which tortures 
is that Mary loves John. 

c. *It's John who I suspect the idea which tortures. 
Bill is that Mary loves. 

?*The drink I told you Jouv had at 3 o'clock at U' 
o'clock is whiskey. 



including theirs. 

(2.66) a. The girl who just left is Mary. 

b. The one who Mary admires most is herself. ^ 

Therefore, the Peters-Bach proposal is inadequate on observational 
grounds. The way to prevent the generation of the sentences^ in (2.65) 
in the Peters-Bach dialect is clearly not by limiting pseudo-rclefted 
constituents to positions which- may be occupied by s omething , but 
rather by blocking some of the morphophonemic rules reouired by (2.55b. iv) 
under the appropriate circumstances. Thus, while rules like the place 
a t which =^ where, the time at which :^ w hen, the manner in which how, 
etc. (which are independently needed for adverbial clauses in non- 
pseudo-cleft constructions, as shown in Geis (1970)), may be allowed to 
operate freely in most dialects, only the rule t he thing which :^what 
would be allowed such freedom in the Peters-Bach .dialect, while the 
remainder would have to be blocked in pseudo-cleft constructions 
(preiiumably hy restricting them to non-subject position). 



(iii). 



(iv) 
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Thus, neither of Peters and Bach's objections to an extracting 
analysis really go through; On the other hand, the alternative 
like [2. ok) for {2.^ka), must be rejected for the following two 
reasons (a third reason is provided at the end of this section): 

(a) The most obvious defect of {2.6k) is that it is semanticaDly 
incoherent. Indeed, the meaning of {2.3ka) is that the object which 
John IS supposed to have eaten was an apple; it most certainly is 
not that the object in question was the preposition John ate 'an app le. 



In personal communication, Ross has attempted to avoid the 
charge of meaninglessness by proposing that the subject of pseudo- 
cleft constructions is not a free relative, but an interrogative 
clause. Thus, the underlying representation of {2.5ka) would no 
longer be {2.6k) ^ but something Tike (i) J 

(i) The answer to the question 'Vhat did John eat?' 
is '^John ate an apple.' 

^^ile (i) is m-ahingful, it cannot be correct for two reasons: 

(a) it does rot allow the generation of pseudo^-cleft sentences 
with non-reduced heads in surface structure, such as (2.66). 

(b) there is overwhelming syntactic evidence that the subject 
clauses of pseudo-cleft sentences are free relatives and not interro- 
gatives. A large number of tests for discriminating between the two 
types of constructions was proposed in C. Baker (1968), and they all 
suggest that pseudo-cleft constructions involve free relatives. This 
is illustrated below with a few tests: 

(1) Relative pronouns, unlike interrogative ones, cannot be 
stressed: » 

(ii) *What John ate was an apple. 

(2) Subject interrogative clauses require singular verb agreement, 
while pseudo-cleft subject clauses do not: 

(iii) a. V/ho walked into the room / } clear. 

(^*werej 

b. What copulated {^JJ^^ } the dog and the bitch. 

(3) Relative pronouns, unlike interrogative onos, cannot be 
modified by ej^se: 

(iv) a. miat else you saw in the room is the Question, 
b. *What else you saw in the room is the apnle. 

{k) Interrogative clauses can contain more than one wh-wcrd, 
while relative clauses cannot: 



(v) a. What happened to whom isn't clear. 

b. *;fliat I saw when was Bill in the morning. 
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More recently, in Ross (1971), the Bach-Peters proposal is 
defended on the grounds that (vi) is a possible sentence in certain 
dialects. 

(vi) What I ate is I ate an apple. . 

However, the fact that (vi) is a possible surface structure does not 
make it a coherent semantic representation. All the existence of 
(vi) means is that the whole circled S-node nay be 'drapf^ed along' by 
Focus-P3.acement in, the pertinent -dialects . If (2.6U) is the underlying 
representation of (P.5^a), its semantic interpretation can only be 
given by an interpretative rule which applies to the output of deletion 
in the standard dialect and to surface structure in the dialects 
which tolerate {vi) * It is rather strange that Bach and Ross, who 
have consistently supported Generative Semantics in their work, seem 
to advocate an interpretivist approach in Just this one instance. 



(b) Neither elements of the subject nor elements -of the predicate 
can be moved out of pseudo-cleft constructions, as shown in (2.67). 

(2.67) a. I suspect what Mary told you Bill wants is to 

kill his mother. 

b. * It's you who I suspect what Mary told Bill 

wants is to kill his mother. 

c. *It's his mother who I suspect what Mary told 

you Bill wants is to kill. 

Given an analysis of pseudo-cleft constructions as in (2.6U), (2.67b) 
can be excluded in a principled way by invoking the Complex Noiin Phrase 
Constraint (see discussion in 1.2.6). However, given a deletion 
analysis, or Postal's analysis described in Appendix One, there is no 
general principle which could exclude (2.67c), for elements can in 
general migrate quite freely from predicate position, and an ad hoc , 
condition would have to be specified for blocking movements out oJL 
pseudo-clefted constituents.'' To see thiVmore clearly, contrast/ 
(2.68a), which is ambiguous between a pseudo-cleft and a non-pseudo- 
cleft reading, and (2.68b), where the non-pseudo-cleft reading alone 
is possible: 

(2.68) a. The discovery John announced was a procf of 

Mary's defection, 
b. It's Mary's defection which the discovery 
' John announced was a proof of. 

On the other hand, given an analysis of pseudo-cleft constructions ^ 
as in (2.t>6), both (2.67b) and (2.67c) can be excluded by the CHPC,"^^ 



I have found that there are speakers who can question (but not 
relativize) the whole focus , even though elements of the focus can 
be neither <"uestioned nor relativized. That is, the paradigm in 
i\) holds for those speakers. 
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(i) a. \;hat do you think that what John ate was? 

b. *The apple which you think that what John ate 
was had been bouf:ht in Canada. 

If we accept Akznajian's (1970) claim that cleft sentences are 
derived from pseudo--cleft ones, the distinction between questioning 
and relativizing the focus can be seen more clearly in (ii). 

(ii) a. What is it that John ate? 

b. ^The apple which it is that John ate had been 
bought in Canada. 

The point I wish to make is that the acceptability of (iia) and/or 
(ia) is no counterexample to my claim that the focus originates inside 
a complex NP. The questioned constituent is the triangled NP in 
(2.56), which does not -originate insiae a complex NP, and is therefore 
free to move. 

The reason (ib) and (iib) are bad is that foci are necessarily 
stres'sed, while relative pronouns (unlike interrogative ones) are 
necessarily unstressed. 

In fact, I believe that a deeper explanation is possible here: 
a cleft or pseudo-cleft constituent is the comment of the sentence 
(i.e., the new information thett it imparts), as opposed to the remainder 
of the cleft or poeudo-cleft construction, which constitutes the topic 
(or old information, since it is presupposed); as new infoniiation 
usually receives stress, the fact that the cleft or pseudo-cleft 
phrase constitutes new information is (at least in part) responsible 
for the fact that it is stressed. Questions are similar to cleft or 
pseudor-cleft sentences (this fact has been noticed before); thus, what 
John saw was a dofy presupposes John saw something , while in the case 
of -Whai did John see? the speaker believes John saw something to be 
true. Question wh-words are indicators of where the comment should 
be supptied in the •expected answer* (i.e., in a declarative sentence 
--■-identical with the question in all respects, except that the wh-word is 

r^olaceli by a more informative phrase); therefore, it is not 
^i;i^*prisipg that cleft or pseudo-cleft constituents shotild be able to 
'^Jo^/^iiWxioned, since the expectelS answer necessarily has the comment 
in bhe position of the vh-word. 

On the other hand, a relative vh-pronoun is the topic of the 
relative clause, while the remainder of the clause is its comment 
(this is probably why relative pronouns are always unstressed). 
Therefore, constructions like (ib) or (iib) imply that the relative 
pronoun is both a topic and a comment, that is, both old and new 
information, an obvious impossibility. Notice that we are deal/ng 
here not with perceptual conflict, but with semantic ill-formedness , 
for it is possible to infer a contradiction from (ib) or (iib), 
namely, that which Is simultaneously new and old information; 
consequently, 1 would not expect (ib) or (iib) to be possible sentences 
in any dialect of English (or in any language in which identical 
conditions are allowed by the grammar). 



since the pseudo-^clefted constituent, which contains mother in 



underlying representation, originates inside a complex HP (such as 
the circled one in (i:\56)). 

Notice that nen\.ences like (2.67b) and (2.67c) provide 
independent cupport^ for the claim I made in footnote 12 that all 
free relatives should be represented with genuine (i.e:, lexicaJ) 
heads in remote structure, for unless we make this assumption, neither 
(2.67b) nor (2.67c) can be ruled out by the CIIPC, and both vould 
necer.sitate ad lioc conditions. 

I take it that the arguments given so far demonstrate the 
superiority of (2.55>a) and (2,55b) over the Ross -Peters -Bach analysis 
of pseudo-cleft constructions. Now I address myself to the main point 
at issue, namely,, the interplay of the rule of Focus-Placement and 
the HRC. 

Let us first notice that Focus-Placement must be an unbounded 
rule, as sugnested J3y.,(2.69). - " ' 

(2.69) What I forgot to tell you that Mary insinuated 

that "Bill was prepared to instruct Greg to 
tell the FBI was that you intended to defect 
to Russia. 

Therefore, ^^iven Ross* formixlation of the RRC, Focus-Placement shotiid 
be an impossible rule, and this argument has in fact been used by Ross 
against all analyses of ppeudo-cleft constructions Involving extraction. 
Chomsky (1971) has replied to this objection by offerinr^ a slip;htly 
modified version of the RRC, which ve shall call the RRC.^, and which . 
stipulates that unboundeti rightwards movements are iingrojmnatical only 
if uhe essential variable they cross is non-null (this is equivalent 
to my (2.55c.i)). The RRC* undoubtedly rises' hi/::her in observational - 
adequacy than the RRC, for it. makes it possible both to derive sentences 
liKe (2.69) through extraction, an analysis which, as we saw, is to be 
preferred to the Ross-Bach-Peters one on independent r^rounds , and to 
block all the sentences which the RRC can block. To see this, ve need 
only examine some of the cases which supplied the motivation for the 
RRC and recognize that the xingrammaticality of the crucial constructions 
may be ambiguously attributed to the RRC or the the RRC*. Thus, 
consider the rules of Particle-Movement, Extraposition, Extraposition- 
from-NP and Right-Dislocation e -emplified in (2.70)-(2.73) respectively; 
the a-sentences arise when the corresponding rule has not applied^ 
the b-sentences when it has applied without crossing the bouHSary of 
the sentence immediately containing the moved constituent, and the c- 
sentences when it has applied arid crossed the boundary in question. 
Notice that in each 9^ the c- sentences, the moved constituent has 
crossed not merely an essential variable, but a n on-null one* 

(2.70) a. You testified that my brother had looked over 

the desk, although you knew that your/ 
testimony could send ^-im to jail. 
You testified that my brother had looked the 
desk over, although you knew that your 
testimony could send him to .jail. 
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(2.70) c. *You testified that my brother had looked the 
desk, although you knew that your testimony 
could send him to Jail, over. 

(2.71 ; a. That that John has disappeared is odd is 
surprising. 

b. ^That it is cdd that John has disappeared is 
surprising. 

r. *That it is odd is .surprising that John has 
disappeared • 

(2.72) a. That the claim that John is a traitor is 

unbelievable is odd. 

b. That the claim is txnbelievable that John is 

a traitor is odd. 

c. *That the claim is unbelievable is odd that 

John is a traitor. 

(2.73) a. That you have been annoying. ir^r sister is 

unbelievable . 

b. That you have been annoying her, my sister, is 

unbelievable . 

c. *That you have been annoying her is unbelievable, 

my sister. 



Despite its greater appeal, the RRC* itself cannot be correct, for 
it wongly predicts that (2.7^) is ungrammatical, This is so, if we 
accept the claim, which I defended in (.ll*) in connection with objection 
(g) to the IC, that the structure of the non-pseud.o-clefted counterpart 
of {2.7k) is esentialJy (2.75J. 

(2.7^*) What John drank at k o'clock was a glass of whiskey. 

(2.75) 




ADV 



John drank a glass of whiske7 at k o'clock' 

Indeed, the focus a glass r : vhiskey must cross the non-null at k 
o* clock which lies outside the boundaries of the boxed S. On tlie other 
hand, we would be ill-advised to reject the RRC' completely, since it 
correctly predicts the unacceptability of (2.76)-(2.78) . 

(2.76) ??What I told you John believes in on Monday 

morning at k o'clock is Cod* 

(2.77) 7*Wliat I tola you John drank at 3 o'clock oa 

Monday morning at k o'clock is a glass of whiskey. 

i 



(2.78) ^Vfhat I told you John drank in order to please his 

f^irl friend from Chicago at 3 o'clock oh Monday 
morning at U o'clock is a f^lass of vhiskey. 

xiow can v:e preserve all the adVantaces of the RRC' and at the seme 
time account for the acceptability of {2.^U) as well as for the {gradually 
increasing unacceptability of (2.76)-(2.78)? I propose to do this by 
taking into account not only the fact of a non-ntill instantiation of 
a variable, but also t he size and the nature of such an instantiation, 
as required by (2/55c.ii). " O/e shall call this proposal the RRC" . ) 

With respect to size , it shoiild be pointed out that sentences like 
^(2.7'*) become increasingly awkward in proportion^ to the length and 
complexity of adverbials, as shown by (2.79)-(2.80) . 

(2.79) ?V;hat John drank before our big fat neighbor 

living on Front Street and wearing a red beard 
came in is a glass of whiskey. 

(2.80) ??V/hat. John dr^k before oui^ bif: Tat neir:hbor who 

lives in Chicago and whose onl?/- daughter had an 
unfortunate love affair came in is a glass of 
whiskey. 

The awkwardness of sentences like (2.79) was ascribed by Bever to the 
already mentioned interruption principle (2.37): 

(2.37) Discontinuous -components are complex in proportion ^ 
to the structural complexity of the intervening 
material. 

(2.37) is a plausible principle, since, given an immediate memory*' with 
limited capacity, ve may expect that retention of the. first member of 
a discontinuity to be heard in proportion to the amount of processing 
that must go on before the remainder of that discontinuity is encountered 
and its first member can be discarded from immediate memory. 

Vfith respect to the nature of the instantiation of the pertinent 
variable, let us first notice that it must be taken into account in 
addition to considerations of size, since the size of the adverbial is 
the same in (2.7'*) and (2.76), but the two sentences 'differ sharply 
in acceptability. The crucial difference -can be appreciated by comparing 
(2.81) (a slightly modified version of (2.7^*) to make it similar to 
(2.76) in all respects except the crucial one) and (2.76). 

(2.81) What I believe John drank on Monday morning at 

h o'clock is whiskey. 

In (2.76), but nob in (2.8I), the adverbial forms a constituent with 
material which precedes the sentence from which the pseudo-clefted 
constituent was extracted. I suggest that the unacceptability of 
(2.76) is due to the fact that two interrupted sentences are continued 
separately, so that a hearer has to figure out which continuation is 
to be associated vith which previously interrupted sequence. I therefore 
propose the following tentative principle: 
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(2.82) T^v'o interrupted sentences are perceptually complex 
if one of them is at least in part internal 
(i.e., centers-embedded) to the other. 

It is easy to see that (2.82) can account for the unacceptability of 
all the c-sentences in (2.70)-(2.73) . For example, in (2.70c)/the 
interrupted t hat-clause is internal to the matrix sentence which the 
that-clause itself interrupts; the difficulty results from the fact 
that both sentences have been interrupted after desk, and from the 
fact that the although- clause must be associated with y ou testified , 
v:hile oyer must be associated with the that--dlause. ' An independent 



^^The rules illustrated in (2.70)-(2;73) "also provide evidence 
against both the RRC and the RRC» and in favor of the RRC", for the 
limitations on their scope are not in fact identical, as-^Ross thought. 
Thu^. in structures like (2.75), the scope of Particle-I^ovement is 
indeed the boxed S, but the scope- of Extraposition is. the circled H, 
as shown in (i) and (ii) respectively. 

(i) a. That John may die was cmnounced^ .at noon. 

b. It was announced at noon that John may die. 

(ii) a. John looked over the desk at noon. 

b. John looked the desk over at noon. 

c. *John looked the desk at noon over. 

The restrictions on Particle-Movement are reminiscent of the 
situation found in connection with question phrase movements in 
Japanese: "the RRC" has been both grammatized and strengthened. 
Extraposition provides an illustration of the intermediate situation 
in which the RRC" has been grammatized, without strengthening. 

^{ ' 

virtue of (2.82) is that it can account for the complexity of center- 
embedded sentences to a degree higher. than one, such as (2.83). 

( 2 . 83 ) 2^ Kirl th e_teacher t hat the s^chool fired 
fYtj^eji crXed^. ^ ^ 

The tripiy- and doubly-underscored sentences are both ^interrupted,, 
and the doubly underscored one is •at least in part internal' to the 
triply-underscored one; as in the previously considered examples, the 
difficulty is mainly due to the fact that flunked and cried must be 
paired with the appropriate distinct sequences. 

TZ, 19 



18. 



proposal is similar, but not identical, to the following 
explanation of the complexity of sentences like (2>83) proposed in 
Chomsky and Miller (1963) and Chomsky (1965)^: 

(i) A device cannot interrupt a given procedure more 
than once in order to use that very procedure. 



.1 



ERJC 



I believe (2.82) to be superior to (i) in two respects: 

(a) (2.82), but not (i), can account for ^both (2.83) and for 
(.2.76)-(2*77) (as well as for (2*70)-(2.73c) ) . This is so because 
(i) depends crucially on their existing a third sequence , such as the 
Gin/?;ly-underGCored one in (2.83), which must moreover be of the same 
kind as the/two interrupted sequences, while (2.82) merely requires 
that two distinct interrupted sequences be s eparately continued; 

(b) As Bever has pointed out, it seems ad hoc to propo,se that 
one interruption of a procedure by itself ii perfectly acceptable, 
while two such interruptions are totally unacceptable; this criticism 
does not apply to (2.82), since it is irrelevant 'whether the two- 
sequences are interrupted' by a third one, as in (2.83),, or by each other, 
as in (2.76). It is interesting to note that the alternative explanation 
•proposed by Bever, which I reproduce as (ii) below ^ cannot explain all 
the facts about* centerembedding either, even though it may be able to 
explain some of the facts.' 

(iiy A stimulus may not be perceived as simultaneously 
having two positions on the same^classificatory 
dimension. 

Indeed, Bever blames the unacceptability of (2.83) entirely on the 
* double- functi on • of flu nked ; notice, however, that (iii)^and (iv), 
in which the underscored elements also have double-function, are 
infinitely better than (2.77) . . 

(iii) The buckles of the collars of these coats are strong, 
(iv) John's f at her * s brother is sick. 



^Notice that the requirement that the two sentences be se parately 
continued (which follows from (2.82), can explain the acceptability of 
sentences like (ii) below, which is derived from (i) throuf^h 
Coordination-Reduction, in violation of the RRC and of the RRC*. 

(i) I told Mary that John bought the cake, and I also 
told her that he later ate the cake, 
(ii) I told Mary that John bought, and I also told her 
that he later ate, the cake. 

Given an analysis of the Conjunction-Reduction which cbpies t he ca ke 
simultaneously out the two conjuncts and Chomsky adjoins it to the 
coordinate node, the copying of t he cake out of *the second conjunct 
violates the RRC and its copying out of the first conjunct violates 
the RRC*. However, ti-.e derivation of , (ii) from (i.) does not violate 
(2. 02), since the second conjunct is not; internal to* the first; in 
fact, both discontinuities are resolved s imultaneously by t he cake ^ 
rather than s eparately , as in (2.76) or 12.83). 



If the unacceptability of (2.76) car. ,be attributed to (2.82) /the 
greater unacceptability of (2.77) arid (2.78) can be attributed to the 
combined effect of (2.82) and (2.37). Indeed, in (2.77) and (2.78) 
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the instantiation of the pertinent variable is both longer and more 
coini^lex than in (2 ♦76), since it must be segmented into at least two 
part5^ each of which must be paired "with different seauGrices.20 



There are speaker's who reject ail the seni.ehces in (2.76)- 
(2.73)^lindiscriminately* -consider tHat (2.82) has been grammatized 
in theirvdialects. 



I take it that the arguments I have advanced in support of (2 .37) 
and (2.62) have shown that the .RRC" (i.e., (2, 55c .1) arid " (2.55c .ii) 
v;ith size and nature .interpreted in the sense of (2.37) and- (2.82) 
respectively) is superior- to both the- RRC and the RRC' as an acco\mt 
of the Right" Roof Constraint. But now we he.ve an additional reason 
for adopting (2.55a), (2.55b) and rejecting the Ross-Peters-Bach analysis. 
This is so, not because the former involves :rv Ovement of the pseiido- 
clef ted phrase (indeed, -the behavioral posit5.on does not depend on 
there being a transformation' involved — since (2.82):, for example, is* 
applicable to structures like (2.83), where the pertinent discontinuities 
do not result from the application of a transformation); rather this is 
so because, given a representation Tike (2s61*), the first occurrence 
of J ohn ate and an apple do not^ .form a discontinuous component. 21 That 



My claim that the marginal status of sentences like (2.76) is 
due to the existence of a perceptually complex discontinuity makes it 
irrelevant — at the explanatory level — whether Focus-Placement proceeds 
in one swoop or by cyclic hopping. 



they should be represented as forming a discontinuous component is 
also shown by the difference in acceptability between (2.0Ua) and 
(2.8i+b), which differ only in that the former is a pseudo-cleft sentence 
while the latter is not. 

(2.8J+) a,*-2-?\/hat I safd that John- likes most in front of 

all the people who had assembled in front of 
his house is a bottle .of good French wine* 
b. What I said that John likes most in front of 

all the people who had assembled in front of 
his house is no concern of yours. 

Before concluding this section, an evaluation of the merits and 
demerits of the particular analysis of pseudo-clefts I have proposed 
is in order. 

The underlying representation I propose for (2.5^a), namely, 
something like (i:.85)~(See Appendix One), has the disadvantage of 
allowing empty no^^es in underlying representation. 




Presupposition: jfpHn ate sonething. . 

Empty nodes have been extensively used by Eminpnds in his dissertation 
and in other- papers in an , effort to show that certain transformations: 
are structure preserving (i.e., they do not create more structure). 
To {jive hls-Hypothesis empirical content, Emmonds requires that empty 
nodes be allowed only where a non-renpty constituent of the same type 
can be generated by the base rules (empty nodes carry categorical 
labels and are subject to subcategoriz;ation) . The empirical test of 
the hypothesis^ is that constituents .should not be movable to a 
position where a constituent of the. same type cannot be generated in. 
the base. Easing himself on this assumption, Emmpnds (l969b) presents 
a rather ingenious argument for therej insertion into an empty node. 
"He argues that be usually require a subject and a complement^ as in 
my iincle is very old ^ but that with indefinite subjects, it looks like 
the complement, Liay .be Tnissing, as in ^a huge fire was , in which case 
t her e- in s ert i on is obligatory and yields there was a huge fir e. 
Emmonds proposes a 'simpler' solution for the, latter case: a huge 
fire is the complement rather than the subject, the subject node is 
generated empty by the" base, and there is. inserted into it. The 
proldem with" this p.nalysis is that it implies that the form be 
occurring in these two examples- is the same v erb. However, it is 
clear that in the former case, be^ is a two-place predicate asserting 
a relation between my uncle and the property of being very old, while 
in the latter, it is a one-place predicate asserting the existence 
of a liuge fire.. Consequently, the generation of an empty node for 
there is quite ad hoc in this case and with no semantic justification 
(I have criticized Emmonds' principle for its unnaturalness, while 
assuming it can do the job it is supposed to do; however, Emmonds' 
treatment of there-Insertion is inadequate on observational grounds, 
for while the one-place predicate b£ usually requires there-Insertion , 
there are a few expressions in which it does not, such as G od is , 
or I think, therefore I am , which Emmonds would be forced to claim 
have an empty node in surface structure and are therefore ill-formed). 

A more restricted proposal involving empty nodes would require 
that empty nodes should be allowed only if they, together with the 
non-empty arguments, add up exactly tc the number of arguments that 
the pertinent predicate allows on logical grounds. This .requirement 
is not violated in (2.85), for be is a two-place predicate in pseudo- 
cleft construction, and empty nodes may ^be allowed with the provision 
stated above, when their presence has considerable independent 
advantages.^ I review and evaluate these advantages belovr. 



- 97 - 

First, the fact that the pseudo-cleft phrase originates inside 
the circled S-node. in (2.85) enables us to account for the fact that 
its selectional restrictions are determij^(ed^<'by the verb of which it 
is 8Ln argiiment in the non-pseudo-cleft counterpart. However, J. 
Gundel has pointed*lDut to ne that an alternative account is possible. 
Thus, the copy which the extracted phrase was claimed to leave behind 
is coreferential with the head of the relative clause; as the head 
cau enter into the same selectioned restrictions as the relative 
pronoiin it attracts (since they are coreferential) and as the pseudo- 
cleft construction asserts coreferentiality between the referent of 
the head (further identified 13y the relative clause) and the more 
specific predicate NP, it is not surprising that the latter can take 
part in the same selectional restrictions as 'the less -specific head 
of the relative clause, and therefore- of the copy which is ultimately 
relativized; Therefore, extraction is- not the only possible solution 
here. 

Second, the extraction, analysis ^,accounts for the island character 
of cleft and pseudo-cleft phrases. I do not. see' what alternatives are 
available, but P. Schachter (personeG. communication) has pointed out 
to .me that some hon-pseudo-cieft copular constructions exhibit the 
same properties. Thus, the, doubly-underscored element in the discovery 
was that John likes Mary cannot be questioned, as. ^ who do you think 
the discovery was that John likes is Just as bad as (2.67c). True, 
most copular subjects which disallow movements out of the predicate 
are themselves paraphrasable by a sentence (thus, a more remote 
representation which Generative Semantics wouldassigh to the discovery 
was that . John likes, Mary might be something like the .thin^ which was 
discovered was that Johh likes .Mary ) . However, the claim that *who 
did you say the discovery was that John likes is bad by the CNPP would 
be somewhat contrived. 

Third, the extraction analysis makes it possible to say that 
the pseudo-cleft phrase and some sentence in the subject clause form 
a discontinuous component.. It may legitimately be asked- whether 
extraction is the only way of expressing a discontinuity. Personally^ 
I do not know of a viable alternative, but I would undoubtedly welcome , 
one which would do the same work as my analysis and would hot need 
to employ empty nodes. I would, however , like to point out that 
movement or copying trcms formations have already been used in the 
literature to> express the fact that a structiiral discontinuity exists, 
even though those cases involved exactly the same kind of difficulty 
as my account of pseudo-clefts with respeqt to underlying representation. 
For example, Ross (19^7) posited a rule of Right-Dislocation to 
account for sentences containing after-thoughts, such as (2.73b); 
specifically, Ross proposed that (3.73b) is derived from its •non- 
dislocated counterpart* , (2.73a). That right-dislocated structures, 
involve a discontinuity is strongly suggested by the fact that they 
obey the RRC (see (2.73c)), as well as by the fact—not mentioned by 
Ross — that they allow (apparent) violations of the command-requirement, 
as far as feature-changing is -concerned, in exactly the seine way as 
pseudo-clefts do. Thus, in parallel manner to sentences like (2.59a) 
or (2.59c) there are sentences like I don^t believe it. that Mary ever 
hanned anyone or she did it. compromise herself irrevocably respectively. 



The difficulty is that the main clause and the afterthought can 
be separated by such items as" I mean , namely ,- that is , etc.. , and 
it is not possible to regard I don ,*.t believe it, nsiinely, that Mary 
e ver harmed anyone to be derived from don't believe that Mary 
ever harmed anyone, namely , for such a representation is semantically 
ill-formed^ as it makes the implicit claim that na mely is .a one-place 
predicate. It seems to me that it would be more acceptable to regard 
the source of sentences containing nai.iely as having the latter as 
a two-place predicate, one argmnent being the • non-dislocated 
counterpart* and the other being empty; the dislocated phrase can . . 
then be copied into the empty node by Right-Dislocation ,^ just as the 
pseudo-cleft phrase is copied into an empty node by Focus-Placement. 
As in the case of pseudo-clefts, empty nodes" strike me as more 
tolerable than no nodes at alii ^^^^ 

Fpurth, the extraction analysis makes possible a straight fonmrd 
account of the behavior of all feature-changing processes, as 
shown in (2 .58)-(2 ,62) , and this seems to be the strongest argument 
in its favor. The fact that all the feature-changing processes 
behave as if the neutralizee were commanded by the neutralizer cannot 
be accounted for by an analysis which- takes the underlying form 
of pseudo-clefts to be essentially their surface form; it can, of 
course, be accounted for by the Peters-Bach-Ross analysis, but 
that analysis postulates an incoherent underlying structure, while 
mine merely involves unnecessary material. Consequently, until a/ _i 
superior analysis of pseudo-clefts is proposed, I believe the extraction 
one can stand. 

Regardless of these theoretical considerations, ray fundamental 
concern in this section has been to- show that the interplay of Focus- 
Placement and the RRC" can be handled adequately only within the 
Behavioral Position, which predicts that the derivations in which the 
RRC" is violated can be rated on a gradual scale of unacceptability 
in a principled way. The Strong Transformational Position fails with 
respect to Focus-Placement, for it predicts that there are no pseudo- 
cleft sentences in English. The Weak Transformational Position, 
which predicts that the derivations in which a constraint is violated 
are all ungrammatical, seems to be supported by those dialects which 
indiscriminately reject (2.76)-(2«78) (see footnote 20), but cannot 
account for the majority dialects. On the other hand, the Behavioral 
Position can account for the dialects described in footnote 21, by 
assuming the grammatization of (2. 82). 22 



^^As I pointed out in the Introduction, the problem of graimnati- 
zation cannot be seriously considered before the significance of all 
the pertinent variables is known. The problem might be even more 
complicated, for grammatization may depend not only on the cognitive 
endowment of language learners, but also on their motivations; in 
other words, we may need not only axi explicit theory of cognition, 
but also explicit theories of thought, behavior^ and human interaction — 
still a remote goal at the moment. If, however, I may allow myself 
to speculate on some motivational factors with respect to the cases 
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considered in this section, it seems to me- that a not implausible 
guess as to the failure of the RRC" to grammatize Xn tpseudo-cleft 
constructions is that the likelihood of occurrence of pseudo-cleft 
sentences which violate the RRC^^ as compared with those which do not 
is relatively small. Moreover, sentences like (2.76)-(2,78) have 
stylistic variants which do not violate the RRC'^; for example, (2.78) 
can be rendered by the quite acceptable (i). 

(i) V/hat I told you on Monday morning at h o^clock that 
John drank at 3 o'clock in order to please his 
girlfriend from Chicago is a glass of whiskey. 

In contrast, if question-movements had to occur in SOV languages, the 
RRC^^ would probably be violated in a large number of derivations, and 
consequently a large number of questions would be marginal or totally 
ungrcumnaticed, since the material. over which the question phrase would 
move would have to contain verbs (as shown in (iiO), as there are no 
stylistic variants in which these verbs can- be fronted, for the verb 
is necessarily fined in Japanese. 

(ii) V7atakusi-wa Biru-wa dare-ga suki ta to Mary-wa 
I Bill who like Mary 

shinjite iru to John-wa utagatte iru no ka to kitta. 
believes John suspects asked 

•l asked who John suspects Mary believes Bill likes. ' 

s 

One objection which could be raised against this acco\mt is that, 
so far, no SOV language has been found in which question-movements 
operate, and if the RRC" is indeed a perceptual principle, we may expect 
that there should be SOV languages in which question-movements operate 
freely from the sentence immediately below the question verb, and 
with increasing awkwardness ap we question from deeper levels of 
embedding. While such an objection cajinot be passed over lightly, it 
should be pointed out thajb the predicted likelihood of finding question- 
movements in an SOV language is very small, as there is really no 
pressure in this direction. Indeed, so long as the question phrase 
is overtly identifiable, there is no stringent need to move it. Thus, 
there are dialects of English in which question-movements are optional, 
that is, i*n which (iia) is interpret able as a genuine request for 
information; moreover, question verbs can govern more than one wh-word^ 
only one of which is allowed to move and still sentences with more 
wh-words than question verbs are perfectly interpret able, as shown 
by (iiib). 

(iii) a. You were talking to whom? 
b, I wonder who saw what. 

The assumption that motivational criteria like the one proposed 
above can determine grammatization is supported by the behavior or 
Relativization in Japanese. This rule is subject to the CNPC, the 
CSC and the Crossover Constraint (Ross (1967)); unlike the Japanese 
rul"e of Question-Formation (analogous to wh-placement ) , which is subject 
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to none of these constreiints . As Relativization obeys most of the 
constraints on movement rules, we would expect it to obey the RRC", 
but it does not, as the acceptability of (iv) shows. 

(iv) Biru--ga suki ta to Mary-ga shinjite iru to John-ga 
Bill likes Mary believes John 

utagatte iru ohna no kb wa kiree da. 
suspects girl pretty. ^ 

'The girl who John suspjects Mary believes Bill 
likes is pretty. * 

This suggests that constraints can, be violated when the functional 
motivations for doing so are strong enough, for if Relativization were 
also subject to the RRC", there would hardly be any relative clauses 
in Japeuiese, and apparently a language cannot easily. get by vithout 
relative clauses, (iv) also supports my hypothesis that the RRC" is 
a behavioral principle, for it is conceivable that a behavioral 
principle can be violated, but it is hardly possible to claim that a 
universal syntactic constraint sometimes holds and sometimes does not . 

Concerning the rules illustrated in (2.70)-(2.73) , we may consider 
some possible reasons for their grammatization. Notice that in the 
case of the pseudo-cleft -construction and of Coordination~Reduction 
to the right, there are reasons for the dislodged constituent to rise 
to a certain height: the pseudo-cleft phrase must be a clause-mate of 
the head of the subject relative clause, and the reduced constituent 
must be lifted high enough to escape from "the dominatioh of the 
coordinating conjunction ( and or or). No such motivations exist 
with respect to the rules exhibited in (2.70)-(2.73), and we find no 
movement everi when the extracted constituents would cross a null- 
variable. 

In addition to considerations of double-interruption, there may- 
be additional factors contributing to the unacceptability of the c- 
sentences in (2.70)-(2.73) . Thus, consider the case when the 
remainder of the sentence form which some element has been displaced 
to the right by a non-constituent, as in (2.7bc)* This wovdd force 
the hearer to keep in mind a non-constituent, presumably a more 
difficult task than the retention of a constituent (this has, I believe, 
been shown experimentally); in contrast, since rules do not move non- 
constituents, unbounded movements to the left could never force the 
retention of a non-constituent, and this difference in the effects 
of movements might contribute to the limitation of rightward movements 
more severely than of left-ward movements. 

On the other hand, Particle-Movement does not necessarily leave 
behind strings which look like npn-constituents, as for example in 
<^that you claim that Bill had seen Mary is odd through ; in this situation 
ve may reasonably assume that the closure strategy (2.8) increases the 
complexity of this sentence, for the occurrence of the new predicate 
is after Mary presumably constitutes a sufficient encouragercent for 
the hearer to regard the strong Bill had seen Mary as a complete 
sentence. The same situation favoring erroneous closure exists in 
(2.72c), and probably even in (2.71c). With respect to (2.73c), its 
unacceptability may look puzzling, for antecedents (more correctly. 
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postcedents) can usually come arbitrarily frr after pronouns. However, 
we are not dealing here with an" ordinary r icedcnt, but rather with 
a tag expressing an after-thought, like I mean^ my sister . Hi^tlce 
that such tags are awkward when delayed even if they have not been 
lifted into a higher clause, as shown by the unacceptability of * youWe 
been pursuing her, although I explicitly forbade yoc to get out of 
your room, (I mean,) ay sister , in which the ^ although clause is an 
inserted parenthetical. Contrast this with clauses moved by Extra- 
position, which allow the insertion of long patentheticals: It is 
surprising to me^ although I might have expected it all along, that 
Jo hn has left with another woman . The general conclusion we may draw 
is that erroneous closure is much more likely when elements move to 
the right than to the left, for leftward displaced elements usually 
carry a distinguishing mark which alerts the hearer to the fact ,that 
a discontinuity exists. Such closure is, however, less likely in the 
highly characteristic pseudo-cleft construction than in sentences 
like those in (2.7l)-(2.73) . 



2.2.0. This section discusses various 'minimal distance* 
principles which have been often claimed 'to exist in order to reduce 
the amount of ambiguity* that would otherwise occur in natural 
languages. I believe that, put in this way, the claim is misleading, 
since it suggests a teleological orientation of speakers aimed at 
-minimizing ambiguity; if speakers really followed a strategy like 
•reduce ambiguity whenever possible,* it is difficult to see vhy they 
.should allow ambiguity at all. I prefer to think that there are 
principles which have the indirect effect of reducing -ambiguity in 
certain cases* One such principle could be (2,22), which prompts 
speakers to select, in the absence of indications to the contrary, 
the simplest reading of an ambiguous utterance as the most likely; 
whenever grammatization occurs (marking the less likely reading as 
ill-formed), the result is a reduction in potential ambiguity .j Cne 
(usually) grammatized instance wafi seen in connectionjwith ,the\ cross- 
over restrictions on Relativization and Topicalization; some non- 
grammatized cases are considered below. 

It will also be argued that in cases of equal likelihood, 
ambiguity is possible only when the potential readings are, in a 
sense ill-understood at present, minimally 'distinct.* For example, 
the sentence John has a dog^ Mary has a cat^ and Bill has one too is 
impossible, for the hearer 'has no basis on which to select one of 
the (at least) two potential antecedents of one; as no reading is 
more likely than the other, both are impossible. 

Granted that reduction of ambiguity is not a primary goal of 
speakers but an indirect effect of independent interacting principles, 
it still looks 'intuitively reasonable* to expect that minimal distance 
might increase the likelihood of one 6\it of two or more potential 
readings. In fact, a linear minimal distance principle might even 
appear to derive experimental support from such well-attested facts 
of perception as the recency and primacy effects (iihlch say that the 
first and last elements of a sequence are more easily remembered than 
the remaining ones). Such a claim woxHd, however, be fallacious for 
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the closest pertinent element (for example, the most recent of a 
number of possible antecedents) need*not be the last element heard 
before th e pro^form ; therefore, the recency effect does not support 
a linear minimal ih ;ance principle in general > ^3 in fact,' I will 



i?here is one case in which the recency or primacy effect 
seems to be involved. Thus, a question like would you like milk , 
tea, or coffee? can be answered jres in some languages, with the meaning 
of either milk or coffee (but not both for the some language) • The 
meaning milk seems to be possible in a number of American dialects, 
while the meaning coffee is possible in Burmese (I owe this piece of 
information to A. Zwicky); I do not know of any language in w hich the 
answer jres to the above question can mean tea , and if none in effect 
exist,. this would follow from the non-existence of a strategy which 
selects a medial element in a sequence. 



attempt to show below that whenever a minimal distance principle seems 
to be operative, there exists a more plausible alternative explanation • 

2.2.1.1. One of the most simplistic proposals involving a 
minimal distcmce principle can be found in some traditional textbooks 
which lay down that if a pronoun has severauL possible antecedents, 
the actual antecedent will be the closest to the pronoim, in terms 
of linear distance. That his claim is empirically false is shown by 
the ambiguity of (2.86). 

(2.87) John told Bill that he had become a father. 

(2.66) shows clearly that the prescriptive pronounceiaent referred 
to above is not a rule of English. . In fact, the choice of the closer 
antecedent is not even more likely, which suggests that the choice 
of a remote antecedent creates* no perceptual problems in principle; 
in other words, the cmtecedent pronoun relation does not seem to be 
subject to the usual perceptual constraints on discontinuities (such 
as (2.37)). 

That choosing the more remote antecedent in the case like (2.86) 
is not perceptually complex is also suggested by the fact that there 
is apparently no limit to the amount of discpurse that can separQ.te 
a pronoun from its antecedent, so long as the topic of the discussion 
has not changed radically in-between.. Imagine, for example, one girl 
asking another how she had spent her honeymoon with John in Paris, 
and the newly married one launching into a description -of the beauties 
of Paris, of the magnificence of the Bois-de-Boulogne, of the splendor 
of the Jardin-des-Tuileries, which she woidd conclude with the 
sentence: a nd in front of the statue of Louis XIV at Versallies^ he 
kissed me passionately . It seems to me that even if the account 
contained no reference J;o John, there is still no difficulty whatsoever 
in associating he^ in the above xmderlined sentence with John. The 
hypothesis that establishing a connection between tw NP's'is not 
complex in proportion to the amount of intervening discourse will 
play an important part in my attempt to account for what I called in 
Chapter One the Dichotomois Behavior Principle. 
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2,2.1.2. My clain that linear distance plays no part in 
resolving ambiguities in Standard English might seeoi somewhixt 
STirprising, in view of the important part played by surface order 
in signaling, the roles of various .arguments . However, fhere is no 
mystery here: surface order is perceptually important in role- 
discovery, and in nothing else. . "^reover, it is only important to 
the exten t that it contributes to role-discovery . The latter point 
can be illustrated with an interesting observation of Lakoff 's (l970c) 
to the effect that potential ambiguities are sometimes unacceptable, 
unless there is some feature with disambiguating effect. Thus he 
points out that ambiguity is not tolerated in (a.e^b), although it 
is tolerated in (2.87a), and assumes that the distinction is due to 
the configurational peculiarities of the formtr. Moreover, these 
configurational factors do not lead to unaccept&bility in the presence 
of a disambiguating feature, like the fact that Karv is C-MALED in 
{2.87c}. 

(2.87) a. John told Bill that he had won the Grand 
Prize. 

b. *John and Bill walked in, and he took off his 

coat. 

c. John and Mary walked in, and hi took off his 

coat. 

It seems that for ambiguity to be tolerated,, the various readings 
must achieve a minimal level of » distinctness. » Distinctness, in this 
sense, is an empirically discoverable notion, not one understood in 
advance, and undoubtedly deserves to serve the objecc of a separate 
study. It is sufficient, however, for our purposes to notice that 
the different surface ordering of the two putative antecedents of he 
assigns them different roles in (2.87a), while nothing of the kindTs 
the- case in (2.87b). Instead, surface order is irrelevant in a 
coordination, for, whatever the » roles » of coordinate terms (if on« 
f :cepts the Generative Semantics claim that logical and and or are 
predicates), these are clearly identical; moreover, coordinate terms 
are commutative. It would seem, therefore, that role-distinctness 
is required'^'* in order for ambiguity to be allowed in such instances. 



In the absence of disambiguating information, role-distinctness 
between the two putative antecedents is a necessary, 'but not a 
sufficient, condition for ambiguity to be possible. An exhaustive 
study of this sufficiency condition is beyond tht scope of this 
thesis, but the paradigm belov suggests that (2.87b) is unacceptable 
for at least two reasons, namely (a) that the path connecting the 
antecedents goes through a coordinate node, and (b) that there is 
some node which dominates both antecedents and which is connected 
with the pronoun by a path which traverses a coordinate node. 

(i) «John and Bill decided that he must go. 
(ii) "John saw Bill lying on a sofa, and he dropped dead 
soon after. 

(iii) *That Bill loves Mary and that Joan loves Jack seems 
odd to him. 
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(iv) Johni spoke to Bill< and to I ^^^i > mother, 
(v) That John^ and I Jj^^i 1- mother were sick surprised 



Billj 




(vi) Billi is sick, but neither John^ nor r vife are. 

VTiatever the explanation for conditions (a) and (b) above, (b) is the 
same condition which prevents a discourse-boundary (like full stop, 
question mark, etc.) to intervene between the two antecedents on the 
one hand and the pronoun on the other , although such a boundary is 
allowed between the antecedents, as shown by (vii) and (viii) 
respectively. 

(yii) a. Has John spoken to Bill about the impending 
bankruptcy? 
b. *Yes, and his mother was most upset. 

(viii) a. Does John^ know about the bankruptcy? 

b* Yes, and Bill* has told /^^^i mother too. 
^ thisjj 



and the different surface order of John a nd Bill in (2.87b) cannot 
provide the kind of distinctness required for ambiguity, precisely 
because it is not semantically or role-wise significant. This 
supports my earlier claim that linear distance in itself is irrelevant . 

I believe that the solution to the unacceptability of (2.87b) 
should be sought by investigating the notion of 'distinctness* I 
proposed above, and not by imposing a transderi rational constraint, 
as Lakoff proposes. Lakoff's solution is,^ I think, inadequate, because 
the problem is not statable In pui-ely grammatical terms. Thus, the 
trans-derivational constraint should be formulated in such a way as 
to allow all cases where there is a disambiguating factor, as in 
(2.87c), and this factor need not be strictly linguistic. For 
example, (2.88a) and (2.88b) seem to me perfectly acceptable, although 
the disambiguation is effected by extra-linguistic information; this 
information is context -supplied in the former case, and presupposed 
by the speaker in the latter case. 



(2.88) a. John and Bill walked into and out of the room 
where .1 was sleeping respectlvely^~~and then 
he sErUclTme a big blow on the head, 
b. Mao-Tse-Tung and Jack had a nice chat together, 
and he started boasting he had made China 
th^ most powerful nation on earth. 



I do not see how a grammar could have access to the disambiguating 
.information in cases like (2.88). In (2.88b), for example, the 
grammar would have to be able to check whether what is asserted of 
the pronoun is true of both putative antecedents, and this would 
require the ascription to Mao-Tse-Tung of a feature like C+BOASTS 
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HE! HAS MADE CHINA THE MOST POV/ERFUL NATIOII OH EARTH3! 

2.2.1.3. In section 1.3.3/1, I mentioned .Chonsky^s claim that 
•condition (1.102. i)* (vhich I reproduce below for convenience) •in 
some cases, has the effect- of reducing ambiguity, or to put it 
differently, of increasing the reliability ' of a' reasonable performance 
strategy' which seeks the nearest NP to a verb (or the head noun of 
a nominal phrase) as its subject. • 

(1.102) i. No rule can involve X, Y in 

...X...C ...Z...-mfV...3... 

where (i) Z_ is the subject of WYV and is 
not controlled by a category containing X. 

The cases of possible ambiguity which Chomsky has in nind involve 
a transformation proposed in Dougherty (1970) which derives (2.89a) 
from (2.89c) through C2.89b)._ 

(2.89) a. The men hated each other. 

b. The men each hated the other(s)'. 

c. Each of the men hated the other(s). 

Chomsky claims that each- movement must sometimes operate across 
sentence-boundaries, specifically in those cases where the subject of 
the embedded clause is a PRO-form controlled by a category in the 
matrix clause. Thus, he requires that (2.90a) be derived from (2.90b) 
and not ftrom (2.90c), on grounds of synonymy. 

(2.90) a. We want to kill each other. 

b. Each of us wanted to kill the -other(s). 

c. We wanted each of us to kill the others. 

Condition (I.102.i) serves to prevent the derivation of the b-sentence 
from the a-sentences in (2.91), where Z is not controlled by anything, 
being non-specific, in (2.92), where Z is controlled by the wrong 
category, and in (2.93), where ^ is a specified non-controlled NP. 

(2.91) a. Each of us heard about plans to kill the other(s). 
b. *We heard about plans to kill each other. 

(2.92) a. Each of us ordered John to kill the other (s). 
b. *We ordered John to kill each other. 

(2.93) a. Each of us expected the 3oldier(s) to shoot 

the other(s) . 

b. *V;e expected the soldier to shoot each other. 

c. We expected the soldiers to shoot each other. 

(2.93c) is one of the cases Chomsky claims would be ambiguous without 
(l.l02.i), since it could then be interpreted as derived either from 
(2.93a) or from vb expected each of the soldiers to shoob the other (s) . 
In that case, one might claim that there is a minimal distance strategy 
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which, at least in paurt, caused the introduction of (l.l02.i) into the 
grammar. This would contradict my earlier proposal that minimal 
distance is not an operative criterion in English. However, I see 
some rather serious problems with such a prpposaJ.. ' 

First, one would have to explain why the strategy was granmiatized 
not just in (2.93c), but also in (2.93b), where no risk of ambiguity 
exists. Rule-generalization arguments seem to me rather forced in 
this instance. 

Second, as Chomsky himself notes in his note 31, the input and 
output to the each- movement rule are not always synonymous, since they 
shot each other does not imply each of them shot the other (s) » although 
they. hate each other does imply each,:of them hates the other (s) . 
Similarly, it seems to me that we promised to respect each other is 
consistent with a situation in which one person performed an act of 
promise on behalf of a group, while we each promised to respect the 
other(s ) implies that each member of the group promised individually. 
The lack of s:/Tionymy in some cases does not argue against the each- 
movement rule, since transformations are allowed to change meaning in 
Chomsky's Extended Standard Theory, but it does weaken the motivation^ 
for a minimal distance strategy, by reducing the number of possible 
ambiguities. 

Third, I find the rule of each- movement itself doubtful, as I 
ccmnot think what the source of sentences like those in (2.9^) would 
be, if each must originate as a quantifier of some NP to the left bf 
its surface position. 

(2.9^) a. John talked to Bill about killing each other, 
b. Bob showed Jill a picture of each other. 

It seems to me that both the each- movement and condition (l.l02.i) can 
be dispensed with, and that all we need to do is Id allow the generation 
of each other in its siurface position, with the requirement that the 
expressed or understood subject of the clause each other is it be a 
plural or a collective. This would automatically explain the un- 
grammaticality of (2.91b), since unspecified subjects are not necessarily 
understood as plurals; in fact» if Unspecified Subject is understood 
as someone , it will be a singular. 

2.2.2. A more sophisticated minimal distance principle, based 
not on linear distance, but on path-length measured along tree-branches, 
was proposed in Rosenbaum (196?) in order to determine the unique 
antecedent of a deleted complement subject. This principle was, 
however, convincingly argued not to exist in Postal (1968b)., where it 
was shown that deleted ^complement subjects have no unique antecedents 
in general, as shown in'^(2.95a), and that antecedent-uniqueness was 
determined by independent factors in the subset of cases where it was 
found. ^5 Moreover » even when the emtecedent is unique, the minimal 



^^A natural characterization of this subset was attempted in 
Grosu (1970). 
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distance principle does not always correctly predict it, as demonstrated 
oy the contrast between {2-95b) and (2.95c)~a ininimal pair discovered 
by Jespersen. 

(2.95) a* John talked to Bill about kissing Bertha, 

b. He allowed her to go. 

c. Ke pr6mised her to go. 

2.2.3. In sections 2.2.1 and '2.2.2 I argued against .a strategy 
making use of linear distance cr of distance measured along^tree 
branches. Other minimal distance principles could be devised, and in 
this section, I shall inquire whether there is a principle that resolves 
ambiguities in terms of node-height. Consider, for example, (2.96), 
where the underlined constituent has the structure exhibited in (2.97) • 

(2.96) This is the cat with three kdt^eris . which caught 

a lot of mice lasr week. 

(2.97) 




The cat with three kittens 



The non-restrictive relative clause could modify either t hree kittens 
or the cat wi th three kittens . In neither case wotild linear distance 
make a difference, for the relative clause is adjacent to both putative 
heads. Neither would branch-counting .prefer one reading, for the' same 
number of branches would connect the relative clause to its head, 
regardless of whether the latter is NPi or NP2. However, NP^ is higher 
up on the tree, and one may conceive of a strategy associating the 
relative clause with the lower node NPg. As far as I have been able 
to ascertain, (2.96) is ambiguous and offers no support for such a 
strategy. 

2.2.4.1. I shall now consider a number of cases in which linear 
distance appears to effect disambiguation. Perlmutter, at the 1970 
Conference on English Syntax, proposed a tr?^nsderivatibnal constraint 
to handle the unambiguity of cases like (2.98a), which most people 
cannot perceive as a paraphrase of (2.98b). 

(2.98) a. A woman hit a girl who was pregnant. 

b. A woman who was pregnant hit a girl. 

c. A woman Just left who was pregnant. 

d^. A woman hit the curb who was pregnant. 

Thus, although one would expect the relative clause in (2.98a) to he 
perceived as having arisen in one of two different ways, i.e., by being 
generated as a modifier of a girl or by Extraposition-from-NP, the 
latter reading seems to be cut. An examination of the entire paradigm 
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in (2.98a) will quickly reveal that the reading involving Extraposition- 
from-NP is out just in case ambiguity could arise*. Perlmutter's 
transderivational constraint, was formulated roughly as follows: 

(2.99) A derivation D is^ ill -formed if Extraposition-from- 

NP applies in D, and df , by virtue of this, the 
last line of D becomes stringwise identical with 
7 . the last line of D' , where D and D' differ in 
semantic representation. 

I do not believe that a transderivational constraint is the 
correct solution in this case, because there are dialects (e.g., A. 
Zwicky's) in which (2.98a) is ambiguous, but, in. which the readin/^ 
ascribing pregnancy to* the prjrl is far more likely . The greater likeii-- 
nood of^one reading cannot be captured in the transderivational 
constraint, which can either allow or disallow a reading, I believe 
that, as in other cases we have already seen, there is a perceptual 
principle at work (which is grammatized in Perlmutter's' idiolect but 
not in Zwicky's), namely, (2.22), according to which simpler readings 
are preferred to more complex ones. The greater complexity of the 
structuve resulting ty Extraposition-from-NP in (2.98a) is due not to 
'minimal distance considerations, but to strategy (2.37). As non- 
discontinuity is the extreme case of non-complexity of the intervening 
material, (2.37) correctly predicts that one reading is far less 
complex than the other. 

There is, however, a far more serious problem with (2.99), 
namely, that.it predicts that (2.100) is ill-formed on both potential 
readings • 

(2.100) A woman sent a girl spinning who was pregnant.* 

This is so, because on either reading (2.100) arises through Extra- 
position-from-NP, and thereby becomes identical with the surface 
structure of the other potential reading. In contrast, (2.37) correctly 
predicts that the reading associating the extraposed relative with the 
girl is far more likely than the other (or the only possible one, if 
grammatization has occurred). 

2.2.i|.2. (2.37) predicts that, all other things being equal, a 
continuous reading will be preferred to one involving discon'^inuities 
whenever this situation arises, not only when it is brought about by 
Extraposition-from-NP. Another rule which can create this kind of 
situation is Particle-Movement. (2.101a) shows that the resulting 
discontinuity is possible, with some awkwardness when the intervening 
material is fairly complex, but that .it is extremely unlikely (probably 
out in most dialects) when a continuous reading is possible, with the 
result that (2.101b) cannot in general be read as a paraphrase of 
(2.101c). 

(2.101) a. ?John pushed the little girl' who arrived 

yesterday down. 

b. John pushed the little girl who had fallen down. 

c. John pushed down the little girl who had fallen. 
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It is interesting that the discontinuous reading can be forced 
in (2.101b) by pausing between fallen and down, since the continuous 
reading', although less complex, becomes thereby ungrammatical. This 
cannot be done, however, in (2.98a), since a pause before the relative 
clause could be interpreted as marking the latter as a non-restrictive 
one, and the continuous reading would be possible and preferred. 

2.3. I have attempted in section 2.1 to defend the claim that 
the complexities of erroneous closure, perceptual conflict, and 
interrupted behavior have linguistic relevance. Such perceptual 
limitations will be argued in Chapter Four to be significantly involved 
in Ross^ constraints (as reanalyzed in Chapter Three). 

The most inportant cpnclusion of section 2.2 for what follows is 
that certain remote relations (such as those between discontinuous 
components) are subject to per.ceptual limitations, while others (such 
as antecedent-anaphor relations) are not. This distinction will turn 
out to be crucial in proposing an explanation for the Dichotomous 
Behavior Principle. 



CHAPTER III 



*THE *NUCLEUS-AND-SATTELITE'* CONSTRUCTION 



3.0. The two main purposes of this chapter are to argue that 
complex noun phrases, symmetric and asymmetric coordinate structures, 
and sentences or verb phrases modified, by optional adverbials are 
special^instances of a construction type which I call the ^Nucleus- 

•and-Satellite* construction, and to show that they obey two constraints 
weaker than Ross* Island Constraint (see l.k) (which predicts the 
complete freezing of *A-immediately-over-A* structures). 

3.1. The Hucleus-and-Satellite construction is defined in (3.1) ^ 
and the constraints to which it is subject are stated in (3.2) • 

(3.1) Given three- nodes X, Y, Z, such that 

(i) X and Y belong to the same grammatical 
category, 

(ii) X is properly analyzable as Y, Z,l and X 
immediately dominates both Y and Z, 
(iii) Z is optional, both in underlying and . 
surface structiire, 
V/e call X a Nucleus and Satellite construction 
(N & S) , Y a Nucleus , Z a Satellite 



'he linear order of Y and Z is not relevant 



(3.2) Definition : Given a transformation T and a 
constituent C, 

a. If T moves C, T chops C, and 

b. If T moves an element of C, T maims C. 
With respect to movement transformations, 
N & S*s exhibit the following paradigm: 

(i) The Nucleus cannot be chopped 
(ii) The Nucleus can be maimed 
(iii) The Satellite can be chopped 
(iv) The Satellite cannot be mained 

from under the domination of the N & S node. 

(3.1i-) is the A-over-A condition, while (3. Iii) includes the 
condition of immediate domination which Ross introduced in his Island 
Constraint. (3.1iii) is the condition I have added in order to keep 
out prepositional phrases, usually analyzed as C^^ NPD, complement 
sentences, often represented as C'"5^^ SD, and second coordinate terms. 
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on the assiimption that they are analyzed as or 3 The 
reasons for this added condition will become apparent in a moment* 
(3.1) is an effort to capture the notion of Head and optional 
Dependent . Both the notion of Head and that of optionality of the 
Dependent are hard to define operationally, although there have 
been numerous attempts in that direction (a succint summary of several 
structuralist and transformationalist attempts is given in J. 
Robinson^s "Dependency Structures and Transformational Rules", in 
which she proposes a dependency-grammar solution) • Although there 
is a string intuitive feeling that the head of the construction the 
dog is dor: rather' than the, it is not at all obvious how such a 
claim could be Justified. Robinson proposes that the head is 
always obligatory, while the dependent may or may not be, and 
although this test is as good as any other I am aware of , it is by 
no means a perfect one. The difficulty is that it cannot be applied 
by simply inspecting a given structure, but requires an examination 
of all possible similar structures; some of the obvious difficulties 
are that we can never be sure we have inspected all the pertinent 
structures, we do not have a procedure by which to identify ^corres- 
ponding' elements in similar structures, etc. In fact, Robinson 
regards the head as the * characterizing' element of a construction, 
which is undoubtedly an intuitively satisfactory definition, but not 
one that makes possible a mechanical decision. As I do not have a 
mechanical procedure either, I shall settle for the hope that 
intuitions as to which element is the head of a particular construction 
^vwill in general agree. ^ 



^Bince Robinson's main interest was not to explicate the 
semantic feeling that some element is central to a construction, 
but rather to find a characterizing, necessary terminal element 
which could be identified as the * governor • (in the .sense of G. 
Lakoff (l970a)) of various transformations, she posits heads for 
all constructions , including those that have usually been regarded 
as exocentric. Thus, she has an abs"6ract element T (sentence-Type) 
which constitutes the head of sentences, and she regards the 
preposition as the head oT a prepositional phrase (because it is 
terminal). It is not entirely clear to me how the doubly under- 
scored string in t hat, Jplm will l eave^ his^ y?-^^ is perfectly possible 
would be analyzed in this approach; indeed, *it is not only an S, but 
also a NP, and what the nominal head could be is not easy to see 
(although some abstraction like ^the event* or *the proposition* 
could certainly be posited). In any event, this approach is not of 
interest to our present discussion. Although I have no non- 
intuitive arguments, I do not think it would be easy to make 
informants agree that a preposition is ^modified* by its NP, that 
a complementizer is 'modified* by the following sentence, or that 
a sentence 'modified* its Type, There is no conflict involved 
here, for Robinson makes it clear that she is talking about 
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•characterizing elements', not about 'centers'. I wish, however, 
to keep the notion 'head' for endocentric constructions only. 



-The optionality of constituents is also a notion definable 'in 
general' and with some appeal to intuition, but unfortunately not 
in a mechanical way either. Thus, it is not difficult to assent to 
the proposition that any term of a symmetric coordination is, in 
some, sense,, .^optional, since the other terms can-, in principle, play 
the same role within the containing framework as the entire coordina- 
tion does. The best operational test for optionality would be inter- 
changeability in all contexts, but it is already a commonplace that 
this strong form of optionality is hardly, if ever, encountered. 
Thus , although a coordinate term and the entire coordination are 
often interchangeable, as in (3.3) and (3.^)i counterexamples can 
easily be found, as in (3.5)-(3.7). 

(3.3) a. That Bill murdered John and Mary is awful, 
b. That Biir murdered Mary is awful. 

(3.^) a. Either Bill or Mary will have to leave, 
b. Bill will have to leave. 

(3.5) a. John and Mary went their separate ways, 
b. *John went his separate way. 

(3.6) a. John, Maiy and Bill are my three best friends, 
b. *Jqhn and Bill are my three best friends. 

(3.7) a. Bill, Sally and the, merchant divided the 
^ money among themselves. 

b. *Bill and the merchant divided the money among 
themselves • 



Moreover, interchangeability in context would be a necessary^ 
but not a sufficient, condition for a mechanical decision; thus, 
although the singly- and the doubly-underscored material in (3.8) 
are interchangeable with preservation of grammaticality , it seems 
counterintuitive to regard the singly-underscored material minus the 
doubly underscored one as optional. 

(3.8) The man who talked to the lady with a JrlSJ^^Xy - ^O F^ 
' in her arms is coming towards you. 

It appears that both the notion of head and that of optionality 
of constituents are theoretically ill -defined at present, although 
both are intuitively clear in a large number of cases. I shall use 
both these notions- in what follows, because I believe that interesting 
generalizations hinge on them, without however pursuing the task of 
defining them formally any further. I have coined the terms Nucleus 
and Satellite because, as I shall argue below, coordinate constructions 
satisfy both the structural definition in (3.1) and the behavioral 
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prediction in (3.2), and ceLLling a coordination a head-and-dependent 
construction might appear as a more startling claim than it really 
is. 

The behavior of N & S*s with respect to movement transformations 
as predicted by (3.2). is clearly less rigid than the prediction rnade 
by the IC. Indeed, the latter predicts that neither Nuclei nor 
Satellites are either choppable or maimable, while (3,2) predicts 
that Nuclei can be maimed but not chopped, while Satellites can be 
chopped but not maimed. 3 



3(3.2ii) and (3.2iii) do not, however, make the claim that 
each language will have rules capable of maiming every single 
instance of Nucleus and/or of chopping every single instance of 
Satellite; the claim is merely that there is no generar constraint 
blocking all movements out of N & S's, as the IC predicts. 



It can be seen already why condition (3.1iii) was imposed; 
indeed, prepositional phrases, complement sentences, second coordinate 
terms, and in general, constituents introduced by oblif^atory 
connectors , do not behave as predicted by (3.2). Indeed, the NF 
of a prepositional phrase can be freely moved in English', thus 
failing to comply with (3.2i), and the connector can never be moved, 
while Satellites in general can (as predicted by (3.2iii)). I 
suspect that the freezing of juch connectors is a very strong, probably 
universal, constraint; although my knowledge of languages is quite 
limited, I would be very surprised if a. rule moving connectors 
(whether these precede or follow- the elements they introduce) turned 
out to exist in some languege. 

The problem of proposing an explanatorily adequate account of 
the conjunction of (3.1) and (3.2) will be taken up in Chapter Four, 
but it can be shown rather. easily at this point that my proposals 
escape the objections I raised against the IC in (I.II8). I show 
this in (3.9) below: 



(3.9) a. In simple Extraposition, the extraposed B is 
maimable, although immediately dominated 
by the category S, because (3.1iii) is 
iiot satisfied. 

b. Extraposition from NP is permitted by (3.2iii). 

c. Extraposition-of-PP is permitted by (3.2iii). 

d. Prepositions can be stranded because (3.1iii) 

is not fulfilled, 
e* An NP complementing another NP can be chopped, 
because of (3.2iii). 

f . Complement sentences can be maimed because 

(3.1iii) is not fulfilled. 

g. The maiming of matrix S*s modified by 

adverbial clauses is permitted by (3.2ii)* 

h. The chopping of adverbials is allowed by 

(3.2iii). 
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(3.9) i. Adverb Preposing is allowed by (3.2iii). 

J. The behavior of assymetric coordination will 
be shown to be entiifely consistent with 
the conjunction (3.1), (3.2). 
k. A complete answer is not 'possible at this 
point, but it wi31 later be shown that 
across-the-board chopp .ng is not an 
instance in which a Satellite is maimed 
'from under the domination of the H & S 
node' . 



The IC has thus been replaced by two weaker: constraints, 
(3.2i) and (3.2iv). I shall now show that these two constraints 
are in fact largely independent, both with respect to the structures, 
and tb the kinds of phenomena, to which they are applicable. 

As far as structural applicability is concerned, (3.2iv) alone 
is restricted to Satellites, while (3*2i) is applicable to heads 
in general. Thus, if we accept the currently held views (which, as 
I said, 'I cannot prove ) that the noun is the head of the NP and the 
verb is the head of the VP, there are hardly any rules which move . 
Just the constituent N -or V. The fact that NP's can move out of 
PP's leaving a stranded preposition is not a counterexample, since, 
as I pointed out before, I ^consider the notion 'head? applicable 
to endocendric constructions only (this is precisely the intuition 
which has led previous writers to posit the endocentric/excentric 
dichotomy), unlike Robinson, who posits heads in exocentric 
conf.tructions like PP as well. On the other hand, (3.2iv) cannot 
be e:d;ended to modifiers in general, for, if VP is .regarded as an 
endoc-^ntric construction with the V'as head, a complement of V con 
57iost c^>rtainly be maimed. 

Vith regard to the sets of phenomena which obey (3.2i) and 
(3.2iv), neither of which is restricted to chopping, it is inter- 
esting tiiat they do not coincide. The phenomena which obey (3.2iv) 
have been discussed in Chapter One, as they constitute one member of 
the partition determined by the Dichotomous Behavior Principle. 
Which phenomena obey (3.2i) has not been studied in detiiil, but it 
is sufficient to show that one kind of phenomena obey, one constraint 
and disobey the other in order to show that these constraints 
determine different partitions of syntactic processes. (3.10a) and 
(3.10b) show that any neutralization 'into* complex lIP's is 
constrained by (3.2iv), but not by (3.2i). 

(3.10) a. never heard of the claim that John ever 
killed anybody, 
b. I never knew anybody who claimed that Bill 
was unfriendly. 

There may well be other rules which behave differently with respect 
to the two constraints, but the facts are not entirely clear to me. 
Thus, it would appear that copying rules, which we recall are 
exempt from (3.2iv), are subject to (3.2i), We exemplify this 
property in (3-11) with the rule of Relativization in Hebrew. 
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(3.11) a. Raita et haieled / \ she kvar holex lebeit- 

sefer. 

•You saw the child ( j vho already goes tc 

"school . * 
Haieled she raita oto (*/ \ she kvar holex 
^ lebeit-sefer ( j) xaxam.' 
•The child who you saw him (*( j who already 

goes to school ^ j) is cleve]?. 

Referential pronouns., of the kind that result from copying, 
cannot in general be heads of relative clauses-. The point is 
illustrated with respect to Hebrew and English in (3.12a) and (3.12b) 
respectively (the latter is a transla^jion of the former), 

(3.12) a. Haiti et haialda^, veata gam raita ota^ 

(*( J she kvar holexet lebeit-sefer) . 
b. I saw the girl^, and you also saw her^ 
(*( J who already goes to school). 

It is not entirely clear if (and/or to what extent) (3.2i) is 
responsible for the unacceptability of (3.12). On the restrictive 
reading of the relative clause, the head does not refer independently 
of the N & S which it heads, and thus cannot be coreferential with 
anything. But it is not ovious why the non-restrictive reading of 
the relative clause should be bad (at leafet when the latter is a 
comment of the speaker, since some speakers accept (3.12b) with the 
relative expressing an after-thought; however, on the after-thought 
interpretation, there is no way of knowing, whether the clause 
modifit ; the girl or her!). A possible reason for the unacceptability 
of (3.12a) and (3.12b) may be that it is odd to make a comment on an 
NP whose referent is not identified uniquely, and it seems to make 
more sense to attach a comment to the more fully specified antecedent, 
which has a better chance of achieving identification of the pertinent 
referent. This assumption is supported by the fact that (3.12c) is 
bad in general, but good if spoken with heavy stress on he and in 
the presence of the referent of he 

(3.12) c. He, who is my best friend, is very sick. 

In the latter case, he is in fact a deictic, and deictics are 
perfectly all right with non-restrictive relatives; notice that 
deictics identify uniquely (I will argue below that precisely for 
that Reason deictics cannot take restrictive relatives), and thus 
support my point. 'All this suggests that the badness of (3.12a) and 
(3.12b) may not be due to (3.2i) on either the restrictive or non- 
restrictive reading. It should also be noticed that reflexives 
function similarly; 

(3.12) d. 'John cut himself (*( J who is my best friend)). 

(3.12d) is tolerable \dth the relative as an after-thought, but as 
in the case of (3.12b), it cannot be shown that the after-thought 
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necessarily modifies the reflexive rather than the antecedent. 

One additional reason Sot doubting t>ict pronominalization is 
subject to (3.2i) is that non -referential pronouns can be heads of 
restrictive relatives, as (3 -IS) shows, 

(3 •is) I sav a Pretty girl, and you savf one vh© wasn't 
too attractive i 

The fact that (3.13) is acceptable ties in nicely with my hypothesis 
that referer.tiality was responsible for the unacceptability of 
(3.12b) on the restrictive reading.^ It should also be noticed that 



lYonotins having the surface shape of referential pronouns 
like the .one in (3.12b) can be used nonreferentially, in which case 
restrictive relatives are perfectly possible, as in (i): 

(i) He who commits murder shall be severely punished. , 



(3. 13) is bad if one is tinstressed and the relative is non- 
restrictive; this also fits my hypothesis concerning the unaccept- 
ability of (3.12)' on the non-restrictive reading, since -unstressed 
one is non-specific, and therefore does not identify a referent. On 
the other hand, (3.13) is possible with a non-restrictive reading if 
one is stressed, for one is then specific and does identify a 
referent . 

HP's modified by complement sentences behave essentially in 
the same way as NP's modified by relative clauses, that .Is, they 
cannot be referential pronouns, but can be non-referential ones, as 
(3.IU) shows. 

(S.l**) a. *I heard the hypothesis that Bill has cancer 

and you heard it that Mary has ^jneumonia. 
b. I heard a prognosis that Bill has cancer and 

you heard one that Mary has pneumonia. 5 
c# *I am aware of the fact that Bill is sick, but 

you are not aware of one that Mary is sick 

as well. 



^(S.l'^b) is good only if one is stressed (and, therefore, 
specific). Thus, complement sentences function like non-restrictive 
relatives, rather than like restrictive ones, which is consistent* 
with the observation that complement sentences supply additional 
information on the head NP, but do not provide narrower identification 



The tingrammaticality of (3.lUc) is due to independent reasons; 
indeed, one is indefinite, and the structure *you are aware of a 
fact that Mary is sick is also ill-formed. In general, pronominal- 
ization with one in such cases is bad Jxxst in those cases in which 
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the corresponding full HP is also bad (the constraints on the 
determiners of HP's with complement sentences are discussed at the 
end of section 3.2.3). 

Concerning copying rxiles, I wish to point out that if they are 
made subject to (3.2i)» it becomes possible to explain a baffling 
fact about the CSC in languages with copying rules. Thus» an 
element of a coordinate term can be copied^ but an entire coordinate 
term cannot. The situation is illustrated in (3.15a) and (3.15b) ^ 
with examples from Hebrev (R. Wojcik informs we that the same 
situation is found in Breton), 

(3.15) a, Haieledji^ she raita et ima shelo^ ve et aba 

shel xalm iatsa. 
•The childi who you say his^ mother and 

Chaim's .father has left, 
b. *Kaieled£ she raita oto^ ve ec xaim iatsa. 
'The childi who you saw him^ and Chaim has 

left. 

If » as I shall argue later ^ coordinate structures are H i S's^ the 
paradigm in (3.15a) and (3.15b) is accounted for (3.2iv) and 
(3.2i) respectively. 

The behavior of copied pronouns differs from that of ordinary 
referential pronouns, as the perfect acceptability of (3.l6), ir* " 
contrast to the unacceptabiiity of (3.15b), shows. 

(3.16) Haieled -babait, ve raiti et xaim ve oto korim 

sefarim. 

'The child is at home, and I saw Chaim and him 
reading books. 

The difference between ordinary pronouns and the ones that result 
from copying is that the latter are subject to coreferentiality 
neutralization. Thus, when such a situation arises in Englii?h (even 
by invited inx'erence, as argued In Chapter One) the result is far 
from good, although less bad that? when the pronoun is removed 
completely, as shown in (3.17). 

• (3.17) a. ?*The rock is too heavy for me to try to 
. , /"it and the hammer^ 
pick Ij^Y^^ hammer and it J ^P* 
b. **The rock is too heavy for me to try to 
rs<^\r /^Md the hanmerT „^ 
P^^^ tjthe hammer andj 

Although it is hardly possible to compare degrees of acceptability 
across languages, there is indirect evidence that structures like 
(3.15b) in Hebrew are less unacceptable than structures in which a 
coordinate term has been removed; in other words, the unacceptabiiity 
of (3.15b) is like (3.17a), rather than like (3.17b). The reason 
for believing this is that structures like {3.15b) become acceptable 
under certain conditions, one of which is topicalization, as shown 
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in (3-18)* 

(3.18) Haieled she^oto ve et xaim her ?eti leha iatsa 

min habait. 

•The child that him and Chaim I showed you has 
left the house* . 

Although I am at a loss to explain why topicalization should Improve 
acceptability in such cases, it is reasonable to assume that copying 
a head is less bad than chopping it. This is confirmed by evidence 
internal to Hebrew; question formation is a chopping rule, aiid we 
see that (3.19) is out, despite topicalization. 

(3.19) *Mi ata xoshev she ve et xaim her?eti le imi? 

•Who do you think that and Chaim I showed 
my mother?* 

The contrast in acceptability between (3.15b) and (3.l6) reduces 
the likelihood that the unacceptabllity of (3.12a) or (3.12h) have 
much to do with (3.2i). On the other hand, the fact that reflexives 
and ordinary definite pronouns are both bad as heads of relative 
clauses (as shown by (3.12d) and (3.12b) respectively) does not mean 
that ^eflexivization is not subject to (3.2i), for (3.12d) could be 
out both by (3.2i) and by the semantic constraints on ordinary 
pronominalization discussed above. If Reflexivization is made 
subject to (3.2i) but not to (3.2iv), it becomes possible to explain 
the contrast in acceptability between (l.92a) and (l;9l*). The 
following pair suggests even mpre strongly that Reflexivization. 
obeys (3.2i) rather than the much stronger CSC, as Ross thought." 



^If ordinary pronominalization and feature-changing rulesf are 
not subject to (3.2i), it is rather startling that Reflexivization, 
which is both a pronominalization and a feature-changing rule, be 
subject to it; nevertheless, the facts seem to suggest it. 

In Chapter Four, where I suggest a natural explanation for 
(3.2iv) and (3.2i), I argue that there is a unique principle which 
determines which phenomena are subject to the former. With respect 
to the lattar, I h&ve doubts that Reflexivization and chopping rules 
obey it for the same reason, as' will become clearer in ^.5» where 
the explanation for chopping rules being subject to (3.2i) is 
offered. 



(3.20) a. ^^John hit 




and himself. 



b. John put a knife and a. picture of himself 
on the table'. 

With respect to the copying of sentences modified by adverbials, 
I have been unable to find examples in Hebrew in which a- S is 
copied. An illustration woxad be a sentence like ^ That Bill is 
crazy, mother believes it , if it existed. Notice, however, that the 
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pronouns that woiild occur from such copying would be different 
from both referential pronouns and non-referential ones like one, ' 
since they would involve identity-of-sense anaphora. Although no 
identity-of-sense copying is foimd in English, it can be shown that 
such phenomena are constrained by (3 -21) quite independently of 
copying: 

(3.21) a. John resents Mary's going around naked, but 
Bill does not resent Mary's .going around 
naked for pixrposes of publicity, 
b. *John resents Mary's going around naked, but 
' Bill does not resent it for purposes of 
publicity 

Cwhere for purposes of publicity modifies 
rt, not re sent] . 

Notice that there is a form of identityn^f-sense- pronominalization, 
which is to onej-pronominalization what it in (3.21b) is to 
referential pronouns, as (3.21c) shows.' Presumably, it^ in identity- 
of-sense phenomena implies total identity (as referential pronouns 
do), unlike pro-forms like one and the underscored constituent in 
(3.21c). 

(3.21) c. John resents Mary's going around snaked, but 
Bill does not resent such behavior for 
purposes of publicity. 

In conclusion, it is not clear to me to vhat.^extent the copying 
and pronomihalization of Nuclei is constrained by (3.2i) and by 
independent (semantic) constraints on pronominaiization. Whatever 
1>he situation, it is clear that (3.2i) and (3.2iv) differ with 
respect to the phenomena they affect, as was earlier shown with 
feature-changing rules. 

In this section, we have discarded the notion of A-over-A 
configuration (defined by (3.1i) and (3.1ii)) in favor of the notion 
of N & S (defined by (3.1i), (3.1ii), and (3.1iii)) and of head 
(undefined but assumed to be understood intuitively). We have also 
weakened the prediction that the structures thus defined are totally 
impervious to certain syntactic operations by introducing the more 
limited constraints (3.2i) and (3-2iv). 

The remainder of this chapter, that is, the .sections 3.2, 3.3, 
and 3.i*, is devoted to complex NP*s, optional adverbials modifying 
S's or VP's, and coordinate structures respecitvely. Each section is 
broken down into subsections corresponding to the basic subtypes found 
for each construction type. In each subsection, it is argued that 
the construction under discussion satisfies the requirements in (3.1) 
(and therefore is a legitimate N i s), and that it behaves, with 
respect to movement transformations, as predicted by (3.2i)-(3.2iv) . 

3.2. -In this section, which is devoted to complex NP's, I 
discuss non-restrictive relatives, restrictive relatives, and nominal 
complements, in that order* 
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3.2.1. We begin by considering non-restrictive relatives, as 
in (3.22). 

(3.22) A girl, vho you knov . just left* >— - 

3.2.1.1. I shall first argue that- the underlined string in 

(3.22) satisfied the first pcurt of (3.1ii), that is, that it is 
properly analyzable into two constituents, for it is_'conceivable 
that it might have the structure DET N S. The tests for showing 
that the string DET N forms a constituent in this case are not 
numerous; indeed, the usual way of proving that a string is a 
constituent is by s:howing that it can be moyed by various syntactic 
processes, but the constituent at issue is frozen by (3.2i). Tests 
can nevertheles be found; I offer three below. 

The first test concerns the output of Coordination Reduction 
when this rule applies to the right. Indeed, when the output 
contains a coordination of non-constituents, the latter are obliga- 
torily separated by pause, as we can see in the contrast between 

(3.23) and (3.21;). 

(3.23) a. John and Mary ate the cake, 
b. John, and Mary, ate the cake. 

(3.21;) a. John baked, and Mary ate, the cake, 
b. *John batked and Mary ate the cake. 

Consider now (3.25). If the underscored strings in (3.25a) were 
not constituents, we would expect (3.25b) to be ungrairanatical , 
like (3.2l;b). The fact that it is grammatical suggests that (3.25b), 
like (3.23a), contains a conjunction of constituents. 

(3.25) a. The boy , who wants to go to Paris, ar.'j the 

girl , who wants to go to Paris, are in 
love with one another, 
b. The boy and the girl, who want to go to 
Paris, are in love with one another. 

The second test concerns a. 'hierarchical constraint* on 
identity deletion in coordinate structures proposed in J. K. Y. 
Tai (1971): 

(3.26) Higher identical constituents must be deleted 

before lower identical constituents. - 

On the assumption that the underscored strings in (3.25a) are 
constituents in construction with the following relative clauses, 
(3.26) con explain the grammaticality of (3.27a) and the unfrrammati- 
cality of (3.27b). 

(3.27) a. The boy and girl, who want to go to 

Paris, are in love with one another. 
b» *The boy, who wants to r.o to Paris, and girl, 
who wants to go to Parin, are in love 
with one another. 
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Indeed, (3.27a) can be derived from (3.25a) through the intermediate 
stage (3.25b), while (3.27b) is derived directly fVom (3.25a); The 
contrast in grammaticality can be accounted for by (3.26) if the 
conjuncts of (3.25a) are analyzed as cg s: , but not if they are 
analyzed as Cg^T jj s: or cg^T q^I. the constraint on Coordination 
Reduction is apparently stronger than predicted by (3.26), for 
lower order constituents cannot be reduced to the left even if they 
are the only identical ones, as shown in (3.28). For reasons I do 
not understand, the reduction of lower constituents to the right 
exhibits a gradual decrease in acceptability, rather than abrupt 
ungrairanaticality, as shown In (3.29). 

(3.28) ta. The boy, who wants to see Paris, and the 

girl, who intends to visit London, are 
. in love with one another, 
b. -The boy, who wants to see Paris, and 

girl, who intends to visit London, are 
in love with one another. 

(3.29) a. Mary" baked, and John ate, the cake. 

b. ?I claim that Mary baked, and you think 

that John ate, the ccJce. 

c. ?*I claim that wy sister believes that Mary 

baked, and you think that yo\xromother 
imagines that John ate, the cake. 

. The third test is of more limited interest, since it shows 
that structures like (3.22) cannot have the structure C^^ S::, 

but it does not show that they cannot have the structurS^ ^ 
tgp^ N SD. I shall present it nevertheless ^ , .since the elimination 
of even one alternative is worth considering. 

The heads of genitives can be deleted under certain cond^ions 
which need not concern us here. Thus, (3.29a.) can become (3.30b). 

(3. 30) a. I know Jill^s boy-friend, but not Mary's 

boy-friend. 

b. I know Jill's boyfriend, but not Mcury's. 

If constructions like (3.22) had the internal structure DET SD, 
we would expect (3.31b). to be derivable from (3.31a) , and 
(3.32b) not to be derivable from (3.32a). Both expectations are 
thwarted . 

(3.31) a. I know Jill^s boy-friend, who is from 

Columbus, and you know Mary's boy-friend, 
who is from Columbus . 
^ b. I know Jill 's*- boy-friend, who is from 
Columbus, and you know Mary's. 

(3.32) a. I know Mary's boy-friend, who is a doctor, 

and you know Mary's boy-friend, who is 
an engineer. 
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(3.32) b. I know Mary*s boy-friend, who is a doctor, 

and you know Meiry^s, who is an engineer. 

Having shown that structures like (3^22) satisfy the first part of 
(3..1ii) we have in fact shown that they satisfy the second part of 
(3.1ii). True, the preceding arguments would be consistent with an 
analysis of (3.22) such as in (3.33), but we can avoid the latter by 
imposing the condition (3.3^) which, as far as I am aware of, has 
been assumed by all writers in transformational f^rammar. 

(3.33) NP 




a girl, who you know. 



(3.3V) Never posit more structure than you have evidence 
for. 

V/ith respect to (3.1i), notice that there is an implicational 
relation between it and (S.liii). Thus, the latter implies the 
former, although the opposite is not true, as shown by prepositional 
phrases, sentences in construction with complementizers, etc. There- 
fore, if we can show that (3.1iii) holds, we no longer need to show 
that (3.1i) holds^ I believe that non-restrictive relative clauses 
are among the clearest^ cases of optional constituents, although, as 
I pointed out earlier, I know of no operational tests for Dfovinp; 
optionality. Thus, interchangeability in all contexts with preserva- 
tion of grammaticality would be a good test (if appropriately 
constrained to avoid coxmterexeunples like (3.8)), but it would be 
inapplicable in any event, since we could never look at all contexts ; 
besides, the requirement could presumably be shown not to hold even 
for non-restrictive relatives, by cases like (3.35) • 

(3.35) Mary, who sleeps with anybody who asks her, 

is planning a trip to Europe, and believe 
it or not, her mother doesn't even know 
that she is that kind of girl. 
b.??Mary is planning a trip to Europe, and believe 
it or not, her mother doesn't even know that 
she is that kind of girl. 

The relevance of optionality to the problem at issue is discussed 
at greater length in Appendix Two. 

3.2.1.2. We now turn to the relation between structures like 
(3.22) and (3.2). (3.2i) is illustrated in (3.36). 
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(3.36) a. I told John, who is sick, that he cannot 

go on living here, 
b. *It*s John that I told, who is sick, that 
he cannot go on living here. 

(3.2ii) appears not to be realized in English, since there are 
no rules which maim the configuration CDET H3. There is no violation 
involved, however, for (3.2i), as I have pointed out already, 
applies not just to Nuclei, but to Heads in general. Therefore, the 
constituent N is blocked by. (3.2i); the constituent DET is blocked 
by the Left Branch Condition in English, but apparently not in 
Latin or Russian .> 

(3.2iii) is illustrated by Extraposition -from-NP, although the 
latter is not accepted with non-restrictive relatives^Jji^all dialects 
Thus, (3.37) is acceptable to some speakers and^ unacceptalDle to 
others . 

(3.37) a. A girl, who I couldn^t care less about, has 

just left. 

b. A girl has just left, who I couldn*t care 
less about.. 

(3.2iv) is illustrated by (3.38). 

(3-38) a. I know a girl, who is greatly interested in 
Bill. 

b. *Who do L know a girl, who is greatly 
interested in? 

3.2.2.1. The three tests involving Conjunction Reduction that 
we used in arguing that non-restrictive relative constructions 
satisfy (3.1ii) are mechanically applicable to 'restrictive 
constructions. The same examples (3.25), (3.27), (3.28) and 
(3.3l)-(3.32) with the appropriate commas removed can be used to 
support the corresponding arguments. 

With regard to requirements (S.li) and (3.1iii), they are also 
satisfied in most instances. There are, hqwever, a few apparently 
embarrassing cases, which I discuss below. 

Thus, nouns like tinie, place , manner seem to require rertrictive 
relatives when they are preceded by certain determiners, and 
particularly when they' form optional adverbials. Relative clauses 
are apparently required by a, the , every , optional with some, and 
usually impossible with that . However, it would be rather unnatural 
to claim that the syntactic representation of these few cases is 
radically different from that of other restrictive relative 
constructions. First, notice that the optional! ty of the relative 
clause seems to depend upon the optionality of the higher order 
construction which it forms with its head. Thus, while the 
sentences in (3.39) are always bad, the sentence in (3.^0) are 
acceptable, if some previous context is furnished. 
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(3.39) a. *He arrived at a tine . 

"b. *Ke killed John at a -place ♦ 
c. *He slapped Midf^e in a manner * 

(3.J*0) a. Ke T)roposed a time. 

b. He vent to a place. 

c. He decided on a manner. 

The most interesting pair is (3. 39b)-(3.1*Ob) , since both are 
adverbials distinjp;uished only by optionality. Thus^ the counter- 
argument that one may raise to the effect that a time and a manner 
can be nronominalized with it^ in (3.1*0) but not in (3.39) is 
irrelevant here, since neither* (3.39b) nor (3.1*pb) can be nronominalized 
with it. But if the pertinent sequences have the ?;?une structure in 
(3.39y"and in (3.1*0), it can be shown that they are all llP^s (i.e., 
that (3.1i) is satisfied), since they can.be con^loined with other NP's 
as in (3.1*1). 

(3.1jl) a.. Hp mentioned the time and the verson that T 
was interested in. 
b. He discussed the woman and the place that I 
had found out about. 
^ c . I told him about the princir)le and the manner 
that he was curious about. 

I wish to sufjgest that the sentences in (3.39) are out not because 
they are semantically ill-formed, but because, they are semantically 
odd, or, more specifically, because they violate an accented 
conversational principle like (2.23), which T repeat below for 
convenience: 

(2.23) V/hen something is done for a purpose, and 

that purpose cannot be achieved in principle, 
perceptual conflict arises. 

Indeed, the adverbials in (3.39) are optional in surface structure, 
since it can be inferred that when someone does something, he does that 
at some time, at some place, and in some meinner. Therefore, if one 
goes to the trouble of mentioning the adverbials explicitly, the 
assumption is created that these will carry additional information, 
and this turns out not to be the case. Notice that the presence of 
completely uninformative optional constituents is hip:hly unacceptable 
in other cases as well, in fact, to such an extent that it is difficu3.t 
to tell whether the unacceptability of (3.1*2) has been frammatizod or 
not (the offending constituents are underscored). 

(3.U2) a. ?*The car was given away to a human being . 

b. ?*The president has been murdered by a human 
being . 
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Notice that the sentences in (3»39)* end (3.1*2) remain bad if 
a is replaced by the in the underscored portions. The reason is 
basically the same, I think, since the is referential in non-opaque 
contexts like the above, but fails to identify uniquely if unqualified 
by a relative clause; therefore, the above sentences with the in the 
appropriate places fail to fulfill their purported purpose, and are out 
too, as predicted by (2.23). 

The problem with every seems to be a little different. Thus, 
every seems to be bad not Just in sentences like (3.39), but also in 
sentences like (3.1*0), as "shown by (3.1*3) and (3.1*1*) respectively. 
The reason is, I suggest, the implausibility of the assertion made. 

(3.1*3) a. *He visits her at every time. 

b. *He kissed liary in every place. 

c. *He tortured her in every manner.. 

(3.1*1*) a. *He mentioned every time. 

b. *He went to every place. 

c. *He discussed every manner. 

An appcurently idiosyncratic fact for which I have no explanation is ' 
that all is not subject to the same restrictions as every .* Thus, he 
visits her at all times is acceptable, unlike (1*^1* 3a), because the 
former can be understood with some qualification, such as "he visits 
her at all c onceivable times", or "he does so at all times when he 
is available* , etc. 

The behavior of that is fairly clear. Thus, apart from t«:rtain 
constructions like he is looking for that which nobody has ye t 
discovered , that is referential, and if so, it identifies uniquely . 
Therefore, the addition of a restrictive relative, whose purpose is 
precisely to achieve unique identification, appears as a superfluous 
endeavor and. is ruled out by (2.23). That the unacceptability of 
referential that with restrictive relatives, ^ in (3.1*5), is not an 
idiosyncratic property of that but rather follows from some general 
principle like (2.23) is shown by the fact that , restrictive relatives 
are unacceptable in two other cases in which that is not found, but 
uniqueness of reference is achieved independently of the relative 
clause. The first case involves the oddity of constructions consisting 
of a head HP and a restrictive clause which in turn form the head of 
construction of the same kind. The second case concerns the unaccept- 
ability of restrictive, relatives modifying NP*s vhose referents are' 
empirically known to be unique. These two unacceptable constructions 
are illustrated in (3.1*6) and (3.1*7) respectively. 

(3.1*5) *I showed John that house which is white. 

(3.1*b) *CCThis is the boy who is tall] who you saw 
yesterday]. 

(3.1*7) *I showed Midge the sun which burns fiercely. 
(3.1*5) is unacceptable because both the deictic properties of that and 
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the relative clause ccipled with a definite head NP can identify 
uniquely, and the fact thr.t 'both devices have been used suggests 
that one of them cannot identify uniquely^ which conflicts with the 
presupposition of uniqueness associated with either. {3.k6) is bad 
because the second relative clause also suggests that its NP head can 
be more narrowly specified, and this again contradicts the pre- 
supposition of the first relative clause. Finally, in (3.1*7) » even 
though there is no linguistic feature about the sun requiring that 
it have a unique referent, it is an empirical assumption that there 
is only one sun, and the restrictive relative is just as bad as in 
(3.1*5) or (3.1*6). r 

It remains to explain the behavior of some, since it is not 
obvious why the sentences in (3.39) shoxild be out, while those in 
(3.1*8j should be acceptable. 

(3.1*8) a. He arrived at some time. 

b. He slayed Mary at some place. 

c. He stole the money in some manner. 

To begin with, notice that the sentences in (3.1*8) are acceptable 
only if some is stressed. If some is unstressed, a sentence like 
(3.1*8a) can only be used as an evaded answer to when did he arrive? 
If some is stressed, however, sentences like those in T3.1*8) can be 
used when the speaker does not wish to reveal certain facts to the 
hearer, or when he does not wish the facts in question to be over- 
heard by a third party and, perhaps, hopes that the hearer can guess 
what he has in mind, etc. The difference in acceptability between 
(3.39) and (3.^48) is that some , unlike a^, constitutes an explicit 
signal to the effect that some information will not be revealed, and 
therefore (3,1*8) is not ruled out by (2.23), since there is no longer 
any reason to believe that the adverbial is used for the purpose of 
adding information. 

3.2.2.2. The applicability of (3.2i) is illustrated by (3.1*9): 

(3.1*9) a. I showed the boy who was from Chicago to 
the girl with a red hat. 
b. *It's the boy I showed who was from Chicago 
to the girl with a red hat. 

(3.2it) is inapplicable in English, but this is not a counter- 
example; the reasons are the same as those given in the discussion of 
non-restricted relatives. 

(3.2iii) is illustrated by Extraposition-from-IIP, as shown in 
(3.50): 

(3.50) a. A man who was drunk walked in. ' 
b. A man walked in who was drunk. 

The applicability of (3.2iv) is shown in (3.51); 

(3.51) a. I showed you a boy who likes bananas, 
b. *What did I show you a boy who likes? 
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3.2.3.1. While ray earlier claim that relative clauses vith 
heads have the surface structure representation SD is, as far as 
I know, the view held by most writers in transformational grammar, it 
has been proposed by a number of authors (Chomsky (1970), (1971) in 
particular) that, nouns with complement sentences have the surface 
structure representation CgfT 533^ y^ile the motivations for 
positing such a structure are fairly clear (the desire to express 
the parellism between John^s claim that Mary is dead and John claimed 
that Mary was dead), there is clear syntactic evidence against positing 
the surface structure that Chomsky defend£. 

To begin with, the three tests that I used in arguing that 
relative clauses are properly analyzable into two constituents yield 
the same results fo'r nouns with complement sentences. 

First, (3.52) exactly parallels (3.25), as (3.52b) is grammatical 
even though there is no pause between the underscored conjuncts (as 
we recall, non-constituents cannot be coordinated without intervening 
pause ) . .0 

(3.52) a. The suspicion that John may be a spy and the 

claim that John may be a spy vere both 
discussed by the FBI investigators. 
^» The suspicion and the claim that John may be 
a spy were both discussed by the FBI 
investigators. 
J . 

Contrast also (3.53a)^and (3.53b): 

(3.53) a. John believes, and Mary claims, that Ike is 

stupid. 

b. John's belief and Mary's .claim that Ike is 
stupid are odd. 

Notice that the second conjunct must be flanked by pauses in (3.53a) 
but afife in (3.53b), \fhich suggests that only the conjuncts In the 
former are non-constituents. 

Second, if the determiner were indeed ^^the subject*^ of the NP, 
as Chomsky claims, then we would expect (3.5l*b) to be grammatical. 
This is so, because (3.1*5a), a clear case of determiners in construction 
with the constituent N, does indeed reduc^-j to (3.55b). hioreover', the 
grammaticality of (3.56b) in conjunction with the tmgrammaticality of 
(3.5^b) argues against Chomsky's claim that NP's and S's have the 
same internal structure except^ for node-labels. * 

(3.51*) a. 'Ij^^'^^' claim that Bill was a spy and 

"[the^ ^ insinuation that Mary had 



recruited him came as a big surprise. 
^* *|^he^ ^ claim that Bill was a spy and 



insinuation that Mary had recruited him 
came as a big surprise. 
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(3.55) a. John^s ideas and John's beliefs are shocking^ 
b. John's ideas and beliefs are shocking. 

(3.56) a. John claimed that the boys were dead and 

John insinuated that Mary had murdered them, 
b. John claimed that the boys were dead and 
insinuated that Mary had murdered them. 

Third, recal.. the rule which deletes the constituent N in 
construction with a genitive detexTiiiner . If Chomsky's analysis were 
correct, we woiad expect (3.58a) to reduce to (3*58b) in the same way 
in which (3.57a) reduces to (3*57b). However, (3.58a) and (3.58b) 
are not paraphrases, since the missing constituent in the latter can 
only be understood {if at all) as claim but not as claim that Mary 
deceived Bill. 



(3.57) a. I like John's ideas, but not Mary's ideas, 
b. 1 like John's ideas, but not Mary's. - 

(3.58) a. I h*ard of John's ^claim that Mary deceived 

Bill, but not of Jack's claim that Mary 
deceived Bill. 
b.??I heard of John's claim that Mary deceived 
Bill, but not of Jack's. 

TiST *he construction at issue had indeed the structure 

Cfjp Cjj S3'D, that is, if the DET were indeed in construction with a 
larger N consisting of N and S rather than with the smaller N, we 
would predict not only that (3.58a) reduces to (3.58b), but also that 
(3.59a) does not reduce to (3.59b). As can be verified, both 
predictions are incorrect. 

(3.59) a. I heard John's claim that women are eqxxal to 

men but not Mary's claim that men aire 
inferior to women, 
.b. I heard John's claim that women are equal to 
men but not Mary's that men etre inferior to 
women. 

A fourth xmdesirable consequence of accepting Chomsky's analysis 
is that the extraposition of relative clauses end of sentences in 
apposition could no longer be stated as one rule; instead, we would 
have to formulate a rule of Extraposition-ffom-N, in addition to 
Extraposition-from-KP. This is so because it is necessary to specify 
the category of the constituent that the extraposable clause is in 
construction with, or we would predict extraposition in cases like (3.6o) 

(3.60) a. Although I hate the idea> that Bill will have 

to go to prison is something we will have to 
face, sooner or later, 
b. As I once told a little boy» who steals my 
purse steals trash. 
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We have thus seen four reasons for believing that constructions 
containing sentences in apposition are properly analyzable into two 
constituents from which we may conclude that they satisfy (3,lii). 
We now attempt to show that such constructions satisfy (S.li) by 
showing that they satisfy (3.1iii). 

As far as I am aware, NP's which allow sentences in apposition 
do not require them. Thus, a hypothesis , the hypothesis , that ' 
hypothesis and Billys hypothesis can all occur as independent IJP's 
without modifying clauses. True, (3.6la) is preferable to (3.6lb), 
but the latter seems to me no worse than (3.6lc). (3C6nb) and {3.6lc) 
are strange without sufficient context, because the implies a known 
referent, but does not make unique identification possible. 

(3.61) a. I heard the claim that the earth is round, 
b. I heard the claim. 

c I spoke to the boy. 

On the other hand, some of the abstract nouns which allow 
sentences in apposition disallow certain determiners, or disallow 
sentences in apposition with certain determiners.- Thus, we have 
neither ^Bill^s fact nor ^ill^s fact that Mary is a thief (for 
semantic reasons, presumably since it is hard to see what such 
constructions can mean). Specific combinations of determiners and 
.nodiiying sentences are also bad, as seen in (3.62). 

(3.62) I am aware of (a) j *a | fact that the earth 

(b) ] the \ 

(c) (^*that J 

is round. 

Both (3.62a) and (3.62c) imply that there is more than one fact that 
the earth is round, and are therefore out on semantic grounds. Notice 
that (3.62c) is excluded by a principle we have already considered, 
namely, that narrower specification of an already necessarily unique 
referent is contradictory. Notice that if we substitute claim for 
fact in (3.52), all the readings become acceptable; the reason is 
that claim, unlike fact ^ is ambiguotus between the content of a claim 
and an instance in which that claim was expressed, and the latter 
reading is possible with a and that , for there can be more than one 
occasion on which a claim is given expression. Hypothesis is not 
ambiguous in the way claim is, but it can be safely substituted for 
fact in (3.62), because there can be more than one hypothesis claiming 
that the earth is round. 



There are apparent counterexamples to my hypothesis that a 
unique referent cannot be more narrowly specified with preservation 
of acceptability. Thus, (i) is acceptable, although the sun is a 
unique object and nevertheless sun carries the determiner that . 



(i) Every time I look at that cruel sun, I curse it. 
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The explanation is, however , quite straightforward: preposed 
adjectives like great in the great man .ire ambiguous betvcen a 
restrictive and a non-restrictive reading, and properties denoted 
by nouns are ambiguous in the same way. Thus, if a man is analyzed 
at some level as a X which is a man (as suggested in Bach (1968)), 
the relative clause which predicates a property of X could be either 
restrictive or non-restrictive. I suggest that the acceptability of 
(i) can be explained by analyzing that cruel smi as that X^ which 
is cruel^ and which is a sun ^ an analysis which seems to me correct 
on intuitive semantic grounds as well. 



3.2.3.2. having shown that nouns with sentences in apposition 
satisfy (3.1) we shall now show that they satisfy* (3.2). 
(3.2i) is illustrated in (3.63). 

(3*63) a. I heard the claim that Bill is mad. 
b. *V7hat did I hear that Bill is mad? 

As was explained- in connection with Nuclei of relative clauses , 
(3.2ii) cannot be illustrated with the structure SD, since, the 
maiming of the constituent cSp ND is independently prevented by 
(3.2i) and the Left Branch Condition (in English). 

(3.2iii) is illustrated in (3.61*), 

(3.61*) a. The claim that Bill is mad is odd. 

b. The claim is odd that Bill is mad. 

c. A hypothesis that the number of primes is 

infinite was proposed. 

d. A hypothesis was proposed that the number 

of primes is infinite. 

^ (3.2iv) is illustrated in (3.65). 

(3.65) a. I heard the claim that Bill loves Mary. 

b. *Who did I hear that olaim that Bill loves? 



3.2.1*. We have considered three kinds of complex NP^s in this 
section so far, and have argued that they satisfy both the requirements 
in (3.1) and the predictions in (3.2). There is, however, a kind of g 
construction which, albeit not a complex NP^ is nevertheless a N & S; 



That the construction I am about to introduce is a N fc S (i.e., 
that it satisfies (3.1)) can be shown by using the same kind of 
arguments I have used for NP*s modified by relative clauses or comple- 
ment sentences. The argumentation in this case raises no new problems 
as compared with the already considered cases, and I leave its 
construction to the reader. 



however, this N & S-type is not subject to (3.2iv). Thus, consider (3.66). 
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(3.66) a. I showed you a picture of sfl oi 

b. ThSTTsTand. which I showed you a picture 
of an owner of, is quite large. 

In deriving (3.66b) from a structure which contains (3.66a), the 

^^i^qo:'???!"''^*^"^^^ ^5 *he chopped element is 

J^n^ff ^ of the doubly-underscored NP, Its chopping is, of 
course, allowed by (3.2iii), but since the doubly-underscored KP is 
t^^n^T of f °^ ^^"'^y underscored NP, (3.66b) in effect 
violates (3. 2iv) since it arises through the maiming of the Satellite 
of the singly-underscored NP. 

Construction like (3.66a) were studied by Ross in connection with 
the optionality of Pied-Piping, and he pointed out that no S-node can 
intervene between the NP to be chopped and some higher NP which 
dominates it.' . 



%oss introduces the further restriction that coordinate NP- 
nodes cannot be reordered, in order to prevent (ii) from being ' 
derived from (i). ^ 

(i) I watched Bill end the boy. 
(ii) *The boy Bill and whom I watched was vain. 

Notice however, that the added condition is unnecessary, since 
wh-marking is a feature-changing rule prevented from going into 
coordinations in any case. 



Such constructions have the distinguishing property that they 
can be generated by the recursive application of one rule, such that 
It embeds an NP as complement to a head NP. In other words, given 
the chopped element, the structure of the highest N'i 5 to which it 
belongs is entirely predictable (except for the internal constituency 
of the various head NP's) by one rule. While this property sheds 
little light on the peculiar behavior of the structures which possess 
It and would have to be stated as an ad hoc rider on (3.2iv) so long 
as the latter is regarded as a formal constraint, I shall argue in 
Chapter Four that if (3.2iv) is regarded ^ a performance constraint, 
cases like 13.66) no longer constitute counterexamples; within that 
framework, the inner predictability of the structures at issue will 
emerge as a significant distinguishiiig property. 

3.3. This section is^ broken down into twc main subsections 
devoted to a discussion of optional adverbials modifying sentences 
and verb phrases respectively. That at least those two classes of 
adverbials should be recognized is a fairly well-established point; 
thus. It has often been pointed out that only S-adverbials can be 
preceded by pause- or prepoaed, while only (some) VP-adverbials may 
undergo Passive or belong to antecedents of do so . Additional 
reasons for recognizing this distinction were given in (l.ll8g), and 
I repeat them below in some detail. 
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First, it is a generally accepted assumption that two adverbs 
of the same kind cannot occur in the same clause • Therefore, the 
existence of grammatical sentences like those in (3.67) suggests that 
either the singly and the doubly underscored strings in each example 
are not adverbs of the same type or that they do not belong to the 
same clause; I think that only the latter position is tenable. 

(3.67) a. Johnj slapped his kid^ because he-f had been 

too quiet, becaus e^ hisj w ife . asked himj 
to do that , 

b. John went to the store in order to buy some 
whiskey , in order to antagonize his 
mpther>>in-law . 

The conclusion that the doubly underscored strings belong to higher 
sentences than the singly underscored ones seems zo me rather unavoid- 
able . 

Second, one condition on neutralization to which I know of no 
exceptions is that the neutralizer must command the neutralizee (if 
not in surface structure, at least at some (usually later) point in 
the derivation). Most informants agree that (3.68a) is considerably 
more acceptable than (3.68b). 

(3.68) a. John didn't beat up his wife because he 

ever enjoyed any of that, 
b. *John didn't beat up his wife, because 
. he ever "enjoyed any of that. 

The distinction in graimnaticality between (3.68a) and (3.68b) can be 
readily explained if we assume that in the former, but not in the 
latter, the neutr€d.izer commands the neutralizee, and failure of 
conmand can be achieved in (3.68b) only if the adverbial belongs to 
a higher clause than the main one. 

Third, the neutralization of comparees by comparators can only 
be accounted for by assiuni.ng that comparative and equitative clauses 
are VP-modifiers. The grammaticality of (l.66a) and (l.66b) and the 
ungrammaticality of (l.66c) and (l.66d) shows that such neutrciization 
must be recognized. 

3.3.1.1. Concerning the behavior of S-adverbi€d.s with respect 
to (3.1), if it can be shown that the underlined string in (3.69) is 
a constituent of type S, it will thereby have been shown that (3.69) 
satisfies- (3. li) and (3.1ii). 

(3.69) John left Mary, because he was in love with 

Claire. 

Notice that the first two arguments given in section 3.3 hold only 
if we assume that (3.69) has the structure schematiccOly represented 
in (3.71). but not that represented in (3.70). 
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John left Mary, because he was in love with Claire. 




Indeed, only given a structure like (3.71) do the two adverbs in 
(3.o7a) or (3.67b) belong to separate clauses, and only then does 
Negation fail to command ever or anjr in (3-68b). But if (3.71) rather 
than (3-70) is the correct representation of (3.69) then the underlined 
string in the latter is. indeed a constituent (that it is a constituent 
•of type S hardly requires argumentation, since vhat else could it be?). 

As I pointed out in (l.ll8g), Ross offered two arguments of his 
own in defense of the position that (3.69) involves rather than VP- 
modification. I review them below. 

Ross first argument was that the pause indicated by comma in 
(3.69) is not obligatory, but if we pause at all, the natural place 
for pausing is after the main clause, rather than after John; this, he 
claimed, shows that the immediate constituents of (3.69) are the main 
clause and the adverbial, rather than John and the remainder of (3.69). 
This argument is unconvincing because: (i) pauses between (short 
subjects and VP's are quite bad in general, as shown by (3.72), 

(3.72) *John, left Mary. 

•and (ii) if we do not pause after the main, clause in (3.69), it is 
no longer necessary that the adverbial be a S-modifier, as the singly 
underscored string in (3.67b) shows. 

Rcss* second argument is directed against Langacker (1969), where 
it is claimed that the ungrammaticality of (3.73c) is due to the 
requirement that extraposed clauses be commanded by the constituent 
they cross. Ross proposes that the paradi^ in (3.73) be accounted 
for by his RRC, since Extraposition can also operate from object, in 
^ich case no constituent is crossed. 

(3.73) a. That John is free is surprising because 

criminals belong in jail. 

b. It is surprising that John is free because 

criminals belong in jail. 

c. *It is siurprising because criminals belong 

in jail that John is free. 
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Ross concludes that the adverbial in (3.73a) must have the structure 
represented in (3.71) for the RRC toyield^the correct results. 

As I pointed out in (l.llSg), this argument does not really go 
through, for extraposed clauses can cross non-clausal adverbials, 
as shown in (I.126c). The unacceptability of (3.73) is probably due 
to the fact that rightward movements are fairly bad when crossing a 
sentence, whether the latter is an adverbial or not; thus, (3.7U) is 
no better than (3.73), although no adverbial is involved. 

(3.7U) *I looked the desk which I had received from my 
brother-in-law over . 

An additional reason for rejecting this argument of Ross* is Drovided 
by (3.75). 

(3.75) a. A girl- who was sick jiist left because a storm 

was raging outside, 
b. *A girl just left because a storm was raging 
outwise who was sick. 

(3.75), unlike (3.73a), is ambiguous between a sentential and a verb 
phrase reading of the adverbial. If Extraposition-from-HP were 
constrained by the RRC, a3 Ross claims, we woiad expect (3.75b) to 
be ill-formed on the S-modification reading, but well-formed on the 
VP-modification reading; the fact that (3.75) has no well-formed 
readings falsifies Boss' hypothesis. 

I conclude that the above two arguments of Ross' do not go 
through; nevertheless, I have given other arguments for believing 
that the position he was trying to defend was essentially correct. 

Having shown that S-adverbials satisfy (3.1i) and (3.1ii), it 
remains to show that they satisfy (3.1iii). It seems to me that the 
optionality of S-adverbials is intuitively as clear as that of non- 
restrictive relatives. But since I have already pointed out that 
foolproof operative tests are, in all probability, impossible to find, 
I shall not press the matter any further. Apparent counterexamples 
like (3.76) may be handled by , iring that some featiires of Tense 
and/or Aspect form a con5)on3nv with specific adverbials in underlying 
representation (see McCawley (1971)), and that the discontinuous 
component consisting of Perfect and of the time adverb in (3.76) be 
regarded as an optional dependent of the main clause. 

(3.76) John has been in Columbus since the beginning 

of the term. 

3.3.1.2. That the behavior of S-adverbials confirms the 
predictions in (3.2i) -(3.2iv) is shown by (3.77)-(3.80) , respectively. 

(3.77) a. That John left, because he was sick, is obvious. 

b. *What because he was sick is obvious is that 

John left. 

c. *(lt) because he was sick is obvious is that 

John left. 
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(3.78) a. John left the city, because he. was sick, 
b. The city which John left, bec€.uee he was 

sick, is a real megalopolis • 

(3.79) a. Mary thinks John left the city, because he 

couldn't stand the atmosphere, 
b. It's because he couldn't stand the atmosphere 
that Mary thinks John left the city. 

(3.80) a. John foxmd Mary at the place of the accident • 
b. *The accident which John foxmd Mary at the 

place of was a tragic one. 

3.3.2.1. In showing that optional VP-adverbials satisfy (3.1), 
the main problem is to show that the structure of (3.8l) is 
essentially (3.83) rather than (3.82). 

(3.81) Mary loves Bill as much as she hates Jack. 
• (3.82) 

S 




To begin with, it seems to me that the facts of optionality are 
no more obscure for certain VP adverbs than they are for S-ad verbs in 
generai. Thus, it seems intuitively clear that vhe adverbial clause 
in (3.81) is optional, while the adverb i;i (3.8li) is not, the 
distinction is less obvious, however, for (3.85), where the instrtunental 
adverb is, I claim, optional on the surface, but not in underlying 
structure. 

(3.84) John Went to school. 

(3.85) John cut some bread (with a knife). 
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There is, however, one fact which suggests that the structures 
of (3.81) and (3.85) are different: optional adverbs are subject to 
(3.2iv), but obligatory ones are not, as the contrast in acceptability 
between (3.86) and (3.87) shows. 

(3.86) a. John cut the bread with an edge of his sword, 
b. ?What did John cut the bread with an edge of? 

(3.87) a. John left the city in spite of the owner 

of the palace, 
b. <*What did John leave the city in spite of the 
oimer of? 

This fact can be made sense of if the structure of (3.81) is (3.83), 
but not if it is (3.82). This, together with intuitions about 
optionality, support ray claim that optional VP-adverbials satisfy- 
(3.1). 

3.3.2.2. That optional VP-adverbials satisfy (3.2i)-(3.2iv) is 
shown by (3.88)-.(3 .91) respectively. 

(3. 08) a. That John loves Mary as much as Jack hates 
Jill is. obvious. 

b. **What as much as Jack hates Jill is obvious 

is that John loves Mary. 

c. *(lt) as much as Jack hates Jill is obvious 
that John loves Mary. 

John loves Mcury more than Jack hated Jill . 
It's Mary who John-^loves more than Jack 
hates Jill. 

I suspect John loves Mary as. much as Jack 
hates Jill. 

It's as much as Jack hates Jill that I suspect 

John loves Mary. 
John likes Bill more than the owner of the 
house . 

It's the owner of the house who John likes 
Bill more than.^^ 



(3.89) 



a. 

b. 



(3.90) 



a. 
b. 
c. 
d. 



An intriguing fact, for which I have no explanation, is that 
(iia) and (iib) are unambiguous, although the source, (ia), is 
ambiguous between (ib) and (ic). 

(i) a. John likes Mary more than Bill. 

b. John likes Mary more than Bill likes Mary. 

c. John likes Mary more than John likes Bill. 

(ii) a. ?It^s Bill who John likes Mary more than, 
b. ?Who does John like Metry more than? 
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is a subject. This n e«CS^pu^znn^ "tl l^ " 

(iii) a. John is taller than Bill (is tall). 
^' It s Bill who John is taller than. 

(3.91) a. John likes Mary as :„uch as Bill hates Jill. 

*^°^" as much as 

Bill hates. 

c. John likes Bill more than the owner of the 

nouse* 

d. ^It's the house that John likes Bill more 

than the owner of. 

aie reversJb e sLvrL"n'T'^''' ^-^^ination (in which the terms 

a?e not o r- e^rsib!ef^ ZtZf^'^''' coordination (in which the U>r. 

in a narrow and In fhio ; ^^i^^^^-ic coordination has been defined 

proposed by ZHy ij rid!;::* ""^^ (^^^^^ -e 

instances of coordinaUon^M^h correctly) concerns 

09o^ ^°°^«*^"ation which are necessarily asymmetric such an 

(3.92a), Which cannot be understood synonymously uIJH (3:92b) 

(3.92) a. . John was shot in the chest and then died 

of that wound, 
b. John then died of that wound and was shot 
in the chest. 

A subset of necessarily asyrmnetric coordinations consists of those 
ITIZI^, P"--tion Of gra^ati":?it°; 

(3.93) a. John went and solved the problem. ' 
b. *John solved the problem and went. 

ihus 13.94} is regarded by R. Ukoff 

^^m, i'^T ^^^^ ^" asymmetric coordination, the firit 

o'ver "Ji^^ise'or'^'"* presupposed by the 'second 

'wlnJ^^.!?;! n u presuppose" is rather misleading, for 
presupposition" has been generally used in the literature to refer 
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to those assumptions definable on purely linguistic grounds; a more 
adequate term in this case might be "invited inference", as defined 
in Zwicky and Geis (1971). For a discussion of some sloppy uses 
of the term "presupposition", see Garner (1971). 



as either an instance of symmetric conjunction of poor acceptability, 
or an instance of asymmetric conjunction based on the assumption 
that John owns a Ford. 



R. Lakoff is careful to point out that the predicate CAUSE or PRECP:dE 
holds between (sometimes very elaborate) inferences which can be 
made from the coordinate terms, and not simply between the coordinate 
terms, as suggested in Postal (1971). That the latter proposal 
cannot be correct is shown by asymmetric coordinations like (3.95). 



Clearly, it is absurd to assume that the fact that the boy in 
question is dead now preceded John^s feelings of happiness in the 
morning; rather,- the event which preceded and caused John's happiness 
is "the boy who cursed John died", and the latter can only be 
inferred from the first conjunct in X 3. 95). 

The various writers who have concerned themselves with coordina- 
tion noticed that it is not easy to distinguish between symmetric 
and asymmetric coordination formally. The only difference in 
surface structure (pointed out by R. Lakoff) is that, given a sequence 
of more than two coordinate terms , the latter must be grouped by 
two if they are asymmetric, while this is not necessary when they 
are symmetric. In other words, given a string like (3.96), its 
phrase structure can be either (3.97) or (3.98) on a symmetric 
interpretation, while it must be (3.98) on an asymmetric one. 



(3.9^) . Fords can go fast, and John Just got a ticjcet 
for speeding. 




(3.95) The boy who cursed him is dead now, and John 

was revoltingly happy throughout the morning. 



(3.96) Every day, John eats, drinks and goes to work. 



(3.97) 




(3.98) 



VP 




However, this distinction is neutralized in binary coordinations, 
whose surface structure is (3.99), regardless of symmetriclty . 




The existence of the boxed-X-constituent was convincingly 
defended in Ross (196? ), but I think there are good reasons for 
regarding the boxed X as different in kind (in the sense of 
Appendix Two) from the circled X's, the main reason being that the 
boxed one cannot occur alone — ^whether it is a S, a NP or a VP— while 
the circled ones can. 

The interesting feature of (3.99) for our purposes is that both 
kinds of coordination appear to be N &'S*s. Indeed, Xq directly 
dominates X^, both nodes belong to the same category, and the boxed 
X is optional, because Xq and X^ are categorical nodes of the same 
kind. Moreover, since the boxed X cannot occur alone, there is a 
principled reason for regarding Xt as the Nucleus and the boxed X 
as the Satellite of Xq. 

I am well aware that my claim that coordinate structures are 
H & G^s is rather startling, as \t flies in the face of most 
syntactic analyses, which cons^ r coordination and subordination 
as entirely distinct and unrei jd structures. In the remainder of 
this chapter, I shall try to show that my proposal is more than mere 
terminological jugglery, made possible, perhaps, by insufficiently 
strong constraints placed on the definition of N & S by (3.1) . It 
should be clear, however, that I do not claim that coordination 
and subordination are indistinguishable, but merely that they share 
more properties than meet the eye. Thus, my claim that coordinations 
are N & S's is not different in spirit from the claims made by a 
number of writers to the effect that verbs, adjectives, prepositions, 
etc., all belong to the larger category 'predicate*; obviously, none 
of these writers meant that no distinction is necesseoy between the 
various kinds of predicates, but only that their common features 
must be recognized. 

3.^.1.1. We begin our discussion with asymmetric coordination 
I suggested earlier that in (3.99), X^ should be regarded as the 
Nucleus and the boxed X as the Satellite because the former can 
occur alone while the latter cannot. However, we need to find 
stronger criteria for identifying the Nucleus and the Satellite, since 
(a) there are languages without an overt morpheme an d , in which 
conjunction is expressed by juxtaposition; (b) the first coordinate 
term may be introduced by both or either, and it is no longer obvious 
then that it is optional; (c) (3.1iii) requires that optionality be 
proven with respect to both surface and underlying structure, not 
Just the former. In view of these facts, it seems preferable to 
choose the Nucleus and the Satellite between X^ and X2, rather than 
between the former and the boxed X (in keeping with the suggestion 
made in Chapter One that adverbials — and, by extension, dependent 
terms in general— -be ambiguously defined in such a way as to include 
or exclude the connector) . 



r 
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c^''''^'/^ *° and Xp, we find that there 

are semantic reasons for regarding the- former as the llucleus and 
the latter as the Satellite in asymmetric coordination,- since the 
latter, but not the former, is semantically optional. This is so 
because Xi or its negation is 'presupposed* (in the sense of R. 
Lakoff Csee footnote ID) by X2 in asymmetric conjunction or disjunction 
respectively while Xp is not presupposed by Xi. To see this, 
consider (3.100) and (3.101). 

(3.100) Two and two are four, and I'll fail you for 

writing that two and two are seven. 

(3.101) You will leave my room, or I'll shoot you. 

The meaning of the second conjunct in (3.100) is incomplete without 
the presupposition that two and two are four, for, without it, my 
decision to fail you appears arbitrary (we must assume an arithme- 
tically naive speaker-listener, who needs to hear the first conjunct 
in order to be aware of the presupposition). Similarly, if I utter 
only the second disjunct in (3.101), part of its meaning will be 
lost , for It IS not the same to make the threat that I will shoot 
you and to say that I will do so only if you don't leave my room 
Ithe biconditional inference is in fact an invited one, as shown in 
Geis and ^wicky). In contradistinction, the meaning of the first 
term is complete without the second, for two and two are still four 
even if you write that they are seven on a test, and the only thing* 
I am asking you to do in (3.101) is to leave the room, regardless 
of what I intend to do in case you don't. The point can perhaps be 
made clear by considering the analogy of antecedents and pro-forms. 
A pro-form presupposes an antecedent, but an antecedent does not 
presuppose a pro-form. Thus, in John told Mary that he had a son , 
part of the meaning of he is lost if we don't know it stands for 
^£h£., while no part of the meaning of John is lost if we don't know 
that the same person is later referred to with he. Thus, the 
optionality of the second tern in asymmetric coordination (Xo or 
the boxed X in (3.99)) suggests that this type of construction is 
similar to a main clause modified by an adverbial clause. 

A second reason for regarding second asymmetric coordinate 
terms as similar to adverbials is provided by the behavior of 
backwards pronominalization. Thus G. Lakoff (1968) noticed that 
backwards pronominalization can proceed from adverbial clauses into 
main ones if the pronoun is a non-subject, but not if it is a 
subject,- as (3.102) and (3.103) show. However, as the distinction 
in acceptability between (3.103a) and (3.103b) shows, the length of 
the second term is important, as it determines the level of phonetic 
stress on the antecedent, and the result is acceptable only if the 
antecedent is weakly stressed. On the other hand, as can be seen 
m 1 3. 102a) no amount of stress reduction can save backwards 
pronominalization into a subject. 

(3.102) a. *Hei was hit by Mary, before Johnj had the 
slightest chance to get up and leave, 
b. •He^ was hit by Mary, before John^ left. 



(3.103) a. Mary hit him^, before John^ had the slightest 
chance to get up and leave, 
b. «r^ary hit himj, before John^ left. 

The interesting fact is that asymmetric conjunction behaves exactly 
like main clauses modified by adverbial clauses with respect to 
backwards pronominalization, as the perfect parallelism between the 
paradigm m (3.I0M, (3.105) and that in (3.102), (3.IO3) shows 
Ut IS, of course, necessary to put an asymmetric interpretation on 
the sentences in (3.IOJ1) and (3.105). 

(3.10ii) a. «He^ was killed by Mary, and Johnj was buried 
with full honours in the family mausoleum, 
b. «IIe^ was killed by Mary, and Johnj was buried. 

(3.105) a. Mary killed himi , and Johni was buried with 

full honours in the family mausoleum, 
b. «Mary killed himj, and Johnj was buried. 

The unacceptability of (3.10l*a) can only be explained if we 
assume that a first asymmetric conjunct is in some sense similar to 
a main, rather than a subordinate, clause. It follows therefore 
that the suggestion made in Postal (1971) to the effect that asymmetric 
coordination is derived from subordination, in the sense that the 
terms are arguments of some predicate like PRECEDE or CAUSE, is 
definitely wrong. This is shown quite clearly by (3.IO6), since. 
If the b-5entence were indeed derived from the a-one, there should 
be no difference in acceptability between the two. 

(3.106) a. That she^ was shot in the chest brought it 

about that Maryi eventually died, 
b. *Shei was shot in the chest, and Maryj 
eventually died. 

A third reason for regarding asyTnmetric coordinate structures 
as N & G s is their behavior under maiming. Indeed, maiming a first 
asymmetric coordinate term is infinitely more acceptable than 
maiming a second one, as predicted by (3.2ii) and (3.2iv), and as 
shown by (3.il07)-(3.109).l2 The contrast between the b- and the 



If some of the b-sentences are not fully acceptable to all 
speakers , this is probably due to the fact that there are more • 
explicit and less ambiguous ways of indicating subordination than 
through asymmetric coordination (where subordination is usually 
discovered through elaborate inferences from extralinguistic 
assumptions), and the sentences in (3.107)-(3.109) violate to some 
extent Grice»s conversational principle that statements should be 
as clear as possible; in other words, it may be harder for a hearer 
to realize that the sentences In (3.107)-(3.109) involve N & S^s 
then in cases containing explicit marks of subordination. 
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c-sentences in (3-107)-(3-109) can be explained neither by the CSC 
nor by Ross* claim that asymmetric coordination is exempt from 
the CSC* 



(3.107) a* John murdered the girl, and her father 

(subsequently) suffered a heart attack* 
b* ?The girl who John murdered, and her father 
(subsequently) suffered a heart attack, 
was ray cousin Alice* 
c* *The heart attack, which John murdered the 
girl, and her father (subsequently) 
suffered, was a fatal one* 

Mary is beautiful, and Bill is happy to 

be married to her* 
?Beautiful though Mary is, and Dill is happy 
to be married to her, I definitely like 
Sheila better* 
*Happy though Mary is beautiful, and Bill 
is to be married to her, I definitely 
like Gheila better ♦ 

(3.109) a* Bill will have to eat the sandwich, or 

his mother will tell uncle Joe* 
b. The sandwich which Bill will have to eat, or 
his mother will tell uncie Joe, is a 
fairly large one* 
i^j^ c* *Uncle Joe, who John will have to eat the 

sandwich, or his mother will tell, is a 
rather unpleasant fellow, 

A fourth reason for regarding asymmetric coordination as N & r> 
is that they obey the following principle: 

(3.110) If an element of a Satellite is removed from 

the containing N & S, that element must, also 
exist in and be removed from the Nucleus 
(under certain conditions which remain to be 
made precise)* 13, Ik 



(3*108) a* 
b. 



An explanation for (3.110) and a refutation of an implied 
counterproposal made in Neubauer (l97O0 are offered in Appendix 
Four* 



I have not investigated in detail the exact conditions under 
which across-the-board maiming can occur* However, it seems clear 
that the identical elements must be in •similar positions since 
two elements cannot be subject and object respectively, as shown 
by (i). 
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(i) a. Johrij^ likes Mary and (therefore) Bill hates 
hinij^, 

b. *It*s John who likes Mary and (therefore) Bill 
hates . 

On the other hand, the two elements need not be objects of the same 
kind, as demonstrated by (ii). 

(ii) a. John loves Mary^^ but can never agree with her., 
b. It's Mary who John loves but can never afrxee ^ 
with. 

If across-the-board maiming depends on the parallelism of the 
positions of the moved elements, the parallelism of the larr;er 
structures from which the elements in question are removed is 
also a pertinent factor. Thus, while across-the-board maiming of 
coordinate structures of any kind seems to be universally acceptable 
(subject to the restriction mentioned at the beginning of this note), 
across-the-board maiminR of matrices and complements, as in (iii), 
are marginal to some speakers, and so are (3.111d) and (3.112d). 

(iii) a. John told Mary that he loves her. 

b. It's Mary who John told that he loves. 

The marginality of these sentences is presumably due to the fact 
that the structures containing the identical elements do not belong 
to categories of exactly the same kind. When the containing 
structures belong to clearly distinct categories, as in (iv), where th 
categories are S and NP respectively, the result is considerably 
worse . 

(iv) a. The doctor cured John in spite of himself. 
b.?*It's John the doctor cured in spite of. 

A strange effect of parallelism is that it seemr? to improve 
even structures which would otherwise be unacceptable, (v) :shows 
that violations of the CNPC are less «overe when they occur across- 
the-board . 

(v) a. *It's John who I believe the claim that you killed. 
b.?*It's John who I believe the claim that you killed 
and disbelieve the allegation that you tried 
to help. 



(3.110) is a formalization of an observation made in Ross 
(1967, sec. U.2), and illustrations are provided in (3.111) and 
(3.112). 

(3.111) a. John likes Mary, more than Bill hates ^jiiij^ 




I 

] 
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(3.111) b. It's Maryj^ John likes more than Bill hates 

riieri \ 

c. "It's Pill> John likes Maryi more than 

Bill hates. 

d. It's Mary John likes more than Bill hates, 

(3.112) a, John tore the rug. in rolling 

|iti up ^ i 
Cup the curtain} 
It's the rug, John tore in rolling 



jlti up 1' 
Cup the curtainj 

(the curtain\ j^^n tore 
Uhe rugi J 



c. *It's A ^"^ uuxoa^u ^ j^^^^ ^^^^ Ahe ruA 

in rolling up. 

d. It's the rug John tore in rolling up. 



That asymmetric coordination behaves in the same way as sentences 
modified by adverbial clauses with respect to (3.110) can be seen by 
comparing (3.113), (3.111*) with (3.111) and (3.112). 

(3.113) a. John murdered Alice, and Bill swore to 
avenge hex*. 

b. Alice, who John murdered, and Bill swore 

to avenge her death, was a very nice /rirl* 

c. *It's |^J;^^J' John will have to marry 



d. It's Alice John will have to marry or let go. 

3.1*. 1.1. 2. Regarding asymmetric coordination and (3.2), (3.115) 
shows that (3.2i) is entirely confirmed. 

(3.115) a. I told you that John murdered Alice, and 

her father subsequently suffered a heart 
attack . 

b. *What I told you, and her father suffered a 

heart attack, is that John murdered Alice. 

c. I told you to get out, or I'll hit you. 

d. *What I told you, or I'll hit you, is to get 

out. 

The conrirmation of the predictions in (3.^-ii) and {3.Jiiv) was 
discut;3ed in 3J*.1.1.1 and illustrated witli {3.loY)~(3.]^^y) ^ 

The prediction in (3.2iii) is not confirmed in Knr.lish as there 
are no convincing examples of second asymmetric coordinate terms 
being moved ^ with or without the connector. I do not, however. 
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regard this fact a« particularly damaging to n\y analysis , since 
instances of 'split con,luncts' have been uncovered in other lan;raap:es, 
and, as I pointed out already, there is no guarantee that the option 
offered by staten>c-nts like (3.2ii) and (3,2iii) will be taken in 
each and every language. The instances of split conjuncts which I 
have come across in the literature involve symmetric conjunction, 
and will be taken up in section 3.^.2.2, I see no reason for 
believing that conjunct-splitting is limited to s^inmetric conjunction, 
but since I do not know of any discussion involving asyminetric 
conjunct-splitting in other languages (particularly when asymmetry 
is not decidable by strict linguistic criteria) I will not purs^r^.e 
the matter any further. 

The absence of .split conjuncts (and coordinate terms in general) 
in English is, I believe, due to a phenomenon of contamination of 
the restrictions on symmetric and and or to all the forms which 
superficially resemble those coordinating conjunctions. Thus, there 
are instances of and which, on strictly linguistic grounds, are 
obviously not conjunctions (see also 3.1*. 1.2), such as in (3.1l6a), 
which is in all probability, related to, or even derived from, 
(3.1l6b); nevertheless, the constituent introduced by and cannot 
move, as shown in (3.117a), unlike the constituent introduced by 
to^ which can, as shown in (3,117b), 

(3.116) a. You will have to try and finish your paper, 
b. You will have to try to finish your paper, 

(3.117) a, *What you will have to try is and finish 

your paper, 
b. What you will have to try is to finish 
your paper. 

The behavior of this k'nd of and parellels symmetric and with respect 
to copying as well; thus (3.1l8a) is marginal compared with (3.1l8b). 

(3.118) a,??What you will have to try and do is finish 

your paper. 

b. \Vhat you will have to try to do is finish 
your paper. 

(3.1l8a) seems to have the same degree of acceptability as (3.119b), 
where we are dealing with symmetric and ; the restriction is probably 
the same as that vhich marks as marginal conjunctions of indefinite 
and definite forms in genereil, as shown in (3*120). 

(3.119) a. Mary tried to laugh and to dance. 
b.??What Mary tried to laugh and to do was 

to dance* 

(3.120) a.7?Someone and Bill walked in. 
b.??A cat and the dog ran away. 
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It is interesting that this contamination phenomenon does not 
extend to but in the sense of except ; thus, (3.121) strikes me as 
fairly tolerable* 

(3.121) a. You've been talking about nothing but John, 
b. John, you*ve been talking about nothing but* 

In conclusion, even though the absence of confirmatory evidence 
for (3.2iii) is an embarrassing fact, it is probably not; a genuine 
counterexample . 

3.U.I.2. Before leaving the topic of asymmetric coordination, 
we shall consider a subclass of asymmetric conjunction which 
apparently violates (3.2iv), and which led Ross to the claim that 
asymmetric conjunction is exempt from the CSC. The phenomenon is 
illustrated, in (3.122) and (3.123). 

(3.122) a. 1*11 have to try and finish that book 

before dark, 
b. Vihat will I have to try and finish 
before dark? 

(3.123) a. John went to the store euid bought some 

whiskey. 

b, It^s some whiskey, that John went to the 
store and bought. 

I believe, however, that the conjunctions in (3.122) and (3.123) 
are not N & S^s, since neither term is semantically optional; thus, 
we seem to be dealing with two stages of one event , so that each 
term presupposes (in the sense of R. Lakoff ) the other. Indeed, 
in (3.107a), there is no necessary connection between the girl's 
death and the heart attack her father suffered (John might have 
committed the murder for the purpose of bringing about the attack, 
but the sentence is non-specific on that score). In contrast, in 
(3.123), there is a strong suggestion that the two conjuncts 
express stages of the same event , and both, consequently, presuppose 
each other; the coordination is thus not an N & S, for (3»liii) is 
not satisfied. 

The suggestion (or invited— inference) that we are dealing with 
one event is due, I think, to the fact that the two conjuncts have 
the same subject and to the fact that the second occurrence of that 
subject is reduced. Consider the gradual increase in acceptability 
exhibited in the following paradigm: 

(3*12^) a. *rhis is, the whiskey which John^ went to the 
store- and Paul bought. 

b. ?*This is the whiskey which John.- went to the 

store and he^^ bo>aght. 

c. ?This is the whiskey which John^^ went to the 

store and he^ bought. 

d. This is the whiskey which John went to the 

store and bought. 



In (3.12l*a), the subjects are different; in (3.12Ub), the subjects' 
are the same, but the second one is unreduced; in (3.12Uc), the 
second cubject is partly reduced, while in (3.12l*d), it is 'totally 
reduced • This paradigm shows that the stronger the invited ^ 
inference to the ^effect that the two conjuncts refer to parts of 
the same event, the easier it is to maim the second. 

It is noteworthy that mutual presupposition—as in (3.12Ud)" 
has the same effect on certain purpose adverbs as oh second conjuncts 
both- cease to be Satellites (and therefore become maimable). Thus, 
(3.125) shows the same increase in acceptability as (3,12l*K 

(3.125 )-''5a. *This is the whiskey which John went to the 
store in order that Paul may buy. 

b. *?This is the whiskey which John went to the 

store in order that h$ may buy. 

c. ??This is the whiskey which John went to the 

store in order that he may buy. 

d. This is the whiskey which John went to the 

store in order to buy. 



^Poutsjaa (1929) and Ross (1967) have in fact proposed that 
(i) be derived from (ii). 

(i) John went to the store and bought some whiskey, 
(ii) John went to the store to buy some whiskey. 

Later writers (e.g., Zwicky (I969)) have pointed out, however, that 
(i) and (ii) are not synonymous, for the latter , but not the former, 
allows the continuation • . . but he didn't buy anything . 

Another difference between (i) and (ii) was pointed out in 
Schmerling (1972): only from the latter, but not from the former, 
is it^ possible to infer that the going to the store must have been 
undertaken with the intention of buying the whiskey, as shown by the 
contrast between (iii) and (iv). 

(iii) This is the whiskey which John went to the store' 
and bought, although he had no intention of 
buying anything, 
(iv) ^his is the whiskey which John went to the store 
to buy, although he had no intention of buying 
anything . 



Just as the ideal situation for conjunct-maiming is one in which 
the conjuncts are VP*s, the ideal situation for purpose-adverbial 
maiming is one in which the adverb is a VP-modifier. To see this 
even more clearly, contrast (3.125d) and (3.125e) where the purpose 
adverbials are VP- and n-modifiers respectively. 

(3.125) e. *Thi5 is the whiskey which- John went to 
the store, in order to buy, 
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In Partee (1971), it is argued that Conjunction-Reduction is 
sometimes associated with changes in meaning and the paradigms 
in (3.123) and (3.12U) support her contention. However, (3.125) 
sugge^its that the facts Partee discusses hinge on VP-modifi cation 
(as against NP~, etc., modification), rather than on 
Conjunction-Reduction ; this is also suggested by (3.126). 

(3.126) a. John wants Cto go to the store and to 
buy some whiskey] . 

b. *This is the whiskey which John wants Cto 

go to the store and to buyD. 

c. John wants to Cgo to the store and buy some 

whiskey] . 

d. This is the whiskey which John wants to 

Cgo* to the store and buyD. 

Ths difference in acceptability between (3.126b) and (3.126d) 
suggests that the bracketed constituent in the former is a coordinate 
S-node , while the bracketed constituent in the latter is a coordinate 
VP-node . This suggestion is supported by the faco that (3.126c), 
but not (3.126a) allows the continuation and Mary wants to do 
so too , as veil as by the fact that the pseudo-clefting of the 
bracketed string in (3.126a) does not leave behind a d£ pro-form 
(or an underlying action verb do_, according to Ross (l9Tl)), while 
the pseudo-clefting of the bracketed string in (3.126c) does, as 
shown by (3.126e) and (3.126f) respectively. 

(3.126) e. What John wants is to go to the store 
and buy some whiskey, 
f • What John wants to do is go to the store 
and buy some whiskey. 

The fact that VP-modi float ion behaves alike with respect to 
coordination or subordination is an additional reason for believing 
the claim I made earlier that the distinction between these two 
construction-types is not nearly as hard and fast as it had 
generally been assumed. 

Finally, it should be pointed out that Ross* contention that 
sentences like (3.122a) and (3.123a) are exempt from the CSC is 
only partly true. Indeed, the conjuncts can be maimed, but not 
chopped, as discussed in 3.^.1.1.2 and illustrated in (3.11Ta). - 

3.^.2.1. In 3.^, I pointed out that coordinations of either 
type do satisfy (3.1) and should therefore be regarded as N & S's, 
with as the Nucleus and the boxed X as the Satellite. In 
3.^.1.1.1, I pointed out that a deeper insight may be achieved by 
considering the roles of X^ and X2, rather than of the former and 
of the boxed X. In thii; irection, we consider and Xp in 
symmetric coordinations and ask how, friven the N & H hypothenin, 
the Nucicu:; und llie ;ialeilile can be identified on a nemantic 
basis. 

From the definition of symmetric coordination, it follows that 
the relative surface position of Xi and X2 is semantically irrelevant. 
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?!^t r • ^i^"' ^^^^ "° °" ""^^i^g' therefore. 

Semitic ^J^sJ? " dominated by the boxed X in (3.99) is without 
semantic significance. To determine which node is the Satellite we 
must ask Which one satisfies O.liii). The answer is that neither 
term is optional in an absolute sense, as shown by the syntactic 
and semantic deviance ofTsT^b). but both are optiona^^n a 
I£iaUv£. sense as shown by the acceptatility of both (3.127c) 
^ optionS!' ^^"^ the other one 

(3.127) a. John and Bill left. 

b. *Left. 

c. John left. 

d. Bill left. 

Given the evidence, the only status that can be ascribed to 
symmetric coordination is the one stated below: 

(3.128) Symmetric structures like (3.99) are bi- 

directional N & S's . in the sense that 
Xi is the Nucleus and Xg the Satellite, 
and vice versa. 

.v.i«^' Z ^^^^i ^3-^28) will make it possible to 

explain the syntactic behavior of symmetric cordinations with respect 
to chopping rules without recourse to additional assumptions. 
However. I believe that (3.126; is not merely a trick which makes 
the grammar work, but a statement which makes sense intuitively. 
(3.128) may give pause at first, as it may seem self-contradictory 
to call something a Nucleus and a Satellite at the same time; 
however. Nucleus and Satellite are not incompatible properties 
tS'N f f r^^^ ^ Nucleus is its ability to represent 

^"^^^^^ dependents, and what m'les it a 
oatellite is its ability to disappear without altering the 
construction-type of the larger constituent to which it belongs; as 
far as I can see. both properties ^re satisfied by symmetric 
coordinate terms. It is important to notice that these properties 
refer to potential, not actual situations; indeed, there is no 
coordination token in which a term is both present and absent. 
But surely (3.128) does not assert that any such coordination token 
does exist. The possible impression of self-contradictoriness 
derives. I think, from the possible mistaken feeling that (3.128) 
does in fact make such an assertion. 1° 



(3.128) can be extended trivially to n-term coordination. 
Since only one term must be present, but any of the terms could be 
tnat term, a symmetric coordination with n terms in a N & S in n 
wars, such that in each way. some term is the Nucleus and the 
remainder are Satellites. ^ y 
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I can see at least foiir reasons for accepting (3,128): 

(a) Constructions are either endocentric or exoceritric; 
(3.99) is not exocentric, for Xg is necessarily of the same type 
as either or Xg; therefore, T3.99) must be endocentric, 

(b) If (3»99) is endocentric, and we reject (3.128), we may 
regard both X^ and X2 as Nuclei; in other words, symmetric coordinate 
structures (with n terms) would be 'poly-endocentric^ This hypo- 
thesis has some intuition appeal, and in fact represents the 
traditional position on coordination* However, this position is 

not without difficulties. First, Nuclei (and Heads in general), are 
never optional; therefore, endocentric constructions would require 
an ad hoc statement to the effect that if there is more than one 
head present, only one is obligatory. Second, the behavior of 
symmetric coordination with respect to chopping, etc., rules, does 
not follow from (3.2), Indeed, if symmetric coordinations are* 
pluri-nucleic, no term can move due to (3.2i). But (3.2ii) 
predicts that Nuclei c^ be maimed, and in order to make the theory 
conform to the facts, it would be necessary to regard symmetric 
coordinations as mono-nucleic, the totality of the terms being 
the Nucleus; in that case, 'maiming the Nucleus' could be interpreted 
as 'maiming all the terms across-the-board, but in no other fashion' , 
which is precisely what we find. This 'solution' is, however, 
untenable, because the characterization of a construction as 
simultaneously pluri-nuclei and mono-nucleic is clearly contradictory. 
Therefore, it would be necessary to include the CSC in the grammar in 
addition to (3.l) and (3.2), This decision would be unfortunate 
^in the extreme, not only because it would add additional apparatus, 
but primarily because it would claim that Nuclei behave differently 
in N & S's and in poly-endocentric constructions . 

I conclude that (3.128) is to be preferred to the poly- 
endocentric hypothesis for the reasons expounded above. In addition, 
it should be noticed that my prgposal does not really conflict with 
the traditional view; in fact, it includes the latter , since it 
regards each term as a Nucleus, but it adds the information (which 
the traditional position misses) that each term is relatively 
optional, and thus makes possible a principled explanation of the 
behavior of symmetric coordination with respect to both chopping 
and maiming, 

(c) If we reject both (3.128) and the poly-endocentric 
hypothesis, it might be proposed that both X and X2 are dependents 
of some third element; as no such element exists, this hypothesis 
is untenable. To propose that both terms are Satellites would make 
even less sense, for it would then be predicted that both are 
simultaneously optional. 

(d) Finally, it might be proposed that X^ is the Nucleus 
and X2 is the Satellite but not vice versa (as in asymmetric 
coordination); this would be not only wrong on semantic grounds, 
but would also make false predictions on the behayior of symmetric 
coordination with respect to chopping, feature-changing, etc., rules • 
On the other hand, given (3.128), (3*2) and (3*110) make exactly the 
correct predictions, as we shall see in the next section. 
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3.k.2.2. From (3-128) and from (3-2i) and (3-2iv), it follows 
that neither nor Xg in (3.99) can be either chopped or maimed. 
Thus, the stronger constraints on symmetric coordination, as 
compared with other construction types, are correctly predicted 
The conjunction of (3.128) and (3-2i)~{3.2iv) is almost equivalent 
to the CSC, but not quite, since the latter predicts that the boxed 
X m (3.99) is also frozen, while the former makes no such prediction 
In fact, my proposal leaves open the possibility that the boxed X may 
migrate from under Xq. 

It is not clear to what extent this possibility is realized in 
English. Ross (1967) mentions two rules which he regards as counter- 
examples to his CSC, Conjunct-Insertion and Conjunct-Movement, 
illustrated in (3.129) and (3.130) respectively. 



(3.130) a. John and Bill are similar, 
b. John is similar to Bill. 

(3.129) is a counterexample only to the strongest form of the 
CSC, which precludes all movements of nodes dominated a coordinate 
node, but not to a weaker form which allows rearrangements within the 
domination of the coordinate node; it appears that Ross upheld the 
strong version. (3.129b) does certainly not violate my (3.2), but 
It does not support ray (3.2iii) either, as the latter allows move- 
ments of the boxed X even outside the domination of the coordinate 



(3.130b) does supj>ort (3.2iii) as against the CSC; unfortunately, 
the existence of a transformational relation between (3.130a) and 
(3.130b) has been cast into serious doubt by Quang (1970), who 
pointed out that the alleged rule of Conjunct-Movement sometimes 
fails to preserve meaning and/or grammatical well-formedness or ill- 
formedness. But even if (3.130a) and (3.130b) are not related trans- 
formationally, there are sentences in other languages which suggest 
that the constituent formed by a second coordinate term plus the 
coordinating conjunction can be moved. 

One such language is Hungarian, where the proposition the 
teacher se es the boy and a girl can be rendered in any of the three 
following ways: 



(3.129) 



John will get a bonus on Friday, and he has 
been waiting for that since December. 

John, and he has been waiting for that since 
December, will get a bonus on Friday. 



node. 



(3.131) 



a tanaar laat ja a fiuu ees ed* laafit 
the teacher sees the boy and a girl 



(3-132) 



a tanaar a fiuu ees ed* laedlt laat. 
the teacher the boy and a girl sees 



(3.133) 



a tanaar a fiuu laat j a ees ed* laafit 
the teacher the boy sees and the girl 
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I do not know enough about Hungarian to see whether there is enough 
independent motivation for a Conjunct-Extraposition rule, but 
notice that the above data are consistent with such a rule, which 
would derive (3.133) from (3.132), along the lines permitted by 
(3.2iii).-'-i 



Koutsoudas' account is slightly different: he proposes' 
that (3.133) is derived from (3.131) by fronting the first conjunct. 
His reasons are that the verb agrees in definiteness with the 
closest conjunct in. both (3.131) and (3.132), and that the fact that 
It agrees with the first conjunct in (3.133) would require an ad 
hoc proviso if Agreement applied after the splitting of the conjuncts, 
when both are equidistant from the verb. However, it seems 
plausible to expect the possible governors in object-verb agreement 
to be a fiuu and ed' laant . rather than the former and ees ed' laant . 
If so, we may allow the second conjunct to move along with the 
coordinating conjunction, and to permit Agreement to operate after 
this movement, for the first conjunct is still closer to the verb 
than the second conjunct is. ■• 



Another language in which the boxed X of (3.99) appears to be 
movable is Samoan. Grinder (1969) points out that the proposition 
the girl c uts the boy with the axe and the knife can be rendered in 
one of the following two ways: 

(3.13'*) e sogi e le teine le tama i le to*i ma le naifi . 

cuts the girl the boy with the axe and the knife 

(3.135) e sogi ma le naifi e le teine le tama i le to'l. 

cuts and the knife the girl the boy with the axe 

It seems that the underscored constituent in (3.13U) has been moved 
in (3.135) for topicalization purposes. 

I do not know of any other cases in the literature which have 
been presented as violations of the CSC, but the cases which I know 
of all involve the movement of the boxed.X in (3.99), which supports 
my (3.2iii), since that node can only be viewed as a Satellite. 

If the conjunction of (3.2i), (3.2iv) and (3.128) explains the 
impossibility of either chopping or maiming one of the terras of a 
symmetric coordination, the conjunction of (3.2i), (3.2iv), (3.128), 
and (3.110) explains why across-the-board maiming of symmetric 
coordinate terms is possible, and why it is in fact the only way in 
which symmetric coordinations may be maimed. Indee'd, since eac)i 
term is a tJatelliteof the other, neither can be maimed without the 
other's being maimed too. The same principle applies to n-term 
coordinations where any term is a Satellite of every other term (see 
footnote 16). 

I have argued in this chapter that the conjunction of (3.1) and 
(3.2) improves both observationally and descriptively on the IC, as well 
as on the island constraints stated in Ross (196?) . In the next chapter, 
I shall argue that my proposal, but not the IC, can receive a natural 
explanation in terms of general perceptual principles. 



CHAPTER IV 

INTERRUPTIOH, CONFLICT, CLOSURE, AND ISLAND CONSTRAINTS 



'♦.O. This chapter is an attempt to provide a plausible 
explanation of Ross' most important island constraints in terms of 
principles discussed in Chapter Two and of their partial reanalysis 
defended in Chapter Three. 

The constraint (3.2i) which freezes Nuclei (henceforth; the 
Frozen Nucleus Constraint) is discussed in section where it is 

shown that violations of it can yield two types of situations which, 
it is suggested, can be accounted for by a closure and conflict 
principle respectively. 

The constraint (3.2iv) which precludes movements of elements 
of Satellites outside the scope of the N & S node (henceforth: the 
Subservient Satellite C onstraint ) is discussed in section 1*.3, where 
it is argued that violations of this restriction give rise to 
perceptual conflict. 

The Dichotomous Behavior Principle (stated in (1.52)) is discussed 
in section k.k, where it is suggested that the distinction between 
the syntactic phenomena which obey island constraints and those 
which do not can be accounted for by assuming that the remote 
relations crucially involved in the two situation-types are 
recognized by essentially different types of strategies. 

The Sentential Subject Constraint is discussed in section U.2 
within the more general framework of the mitigating effect of 
predictability on the complexity of interrupted behavior. In 
'contrast to Ross' conclusion that the SSC is not universal, it is 
suggested here that the pertinent constraint (resulting from the 
lower predictability of the location of a partner in a discontinuous 
relationship) can apply non-vacuously only in languages where 
arguments can occur on both sides of the verb, and where moreover one 
of these positions confers upon some argument a distinguished status 
(such as in the SVO languages). 

In view of the fact that violations of island constraints are 
not always obvious from a consideration of the surface structure, it 
is necessary to assume that c ?rtain decoding operations take place 
earlier than others. Section ;.l examines some experimental and 
internal evidence which supports the view that some (but not all) 
decoding operations apply sequentially, and considers the principles 
which may govern the ordering of such operations. It is argued that 
the way in which strategies must be ordered for independent reasons 
is sufficient to account for those instances of island-constraint 
violations which are not directly observable in surface structure. 
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h.l. In Chapter Two, I pointed out that the theory of perceptual 
strategies assumes that the processes it posits are not direct 
replicae of the rules of a transformational grammar ^ even thouf^h it 
is very likely those processes utilize transformational information . 
In this section, I shall attempt to determine the principles which 
govern the ordering of strategies, and demonstrate their usefulness 
for the exploration of specific constraints. 

Discussions of ordering have in genleral been limited to syntactic 
or phonological transformations; in this respect, the constraining 
power of extrinsic rule ordering (an explicit statement of the order 
in which rules must apply) has been contrasted with intrinsic rule 
ordering (an inherent property of pairs of rules, to the effect that 
if both rules apply in a derivation, there is a unique possible order 
of application dictated by their formulation) (Chomsky (1965)). Later 
writers pointed out that extrinsic rule ordering is insufficiently 
powerful to account for newly discovered data, so that ordering 
paradoxes result unless the grammar is provided with additional 
apparatus, such as (arbitrary) markers, which code information 
available at some stage of a derivation for the benefit of later 
stages of that derivation (as proposed in Kenstovicz and Kisseberth 

(1970) , Postal (1970a), Koutsoudas, Zanders and Noll (l97l), Lehman 

(1971) , Baker and Brame (1972) etc.) or, equivalently , global rules, 
which enable grammatical processes to 'peek* at earlier and/or later 
stages of derivations (as proposed in Lakoff (1969) ^ (l970b), (1972), 
Ross (1969b), Postal (1970b), etc.). The proponents of coding or 
global devices have often argued that since suoh devices can do what- 
ever extrinsic rxale ordering can do but no.t vice versa, there is 
little point in having both kinds of devices in a grammar, and they 
have consequently proposed that extrinsic ordering be discarded from 
linguistic theory ♦ 

Clearly, both extrinsic rule ordering and coding or global 
apparatus are still a far cry from providing an explanation of the 
empirically discoverable linguistic data. The reason is that the. 
notion •possible transformation* is at present undefined in linguistic 
theory, and while a transformational grammar without global devices 
is too weak in some ways, it is also too strong in other ways; that 
is, as has frequently been pointed out, it is possible, given the 
presently available notation,' to wite a large number of implausible 
transformations with the same ease with which empirically justifiable 
ones can be written. Thus, it is not in the least clear that the way 
in which rules have been formulated is psychologically correct, and 
since the distinction bf^tween intrinsic and extrinsic ordering depends 
precisely on specific formulations of rules, that distinction itself 
is up in the air. Similarly, as there is no presently available 
definition of the notions •possible coding device* or ^possible 
global rule*, the enormous power of these devices allows, in 
principle, the prediction of a still larger class of implausible 
situations »^ 



1 

This undesirable increase in power shows that the elimination 
of extrinsic ordering in favor of global rules is not necessarily n 
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blessing. ^ If extrinsic ordering is retained emd global rules are 
used sparingly, the number of possible graaunars that can account for 
some set of data is reduced (provided, of course, that we can state 
in a principled way which classes of phenomena must be described 
by rule ordering and which by global rules ) . Chomsky has often 
argued (e.g., in Chomsky (l970b)) that given the extreme ease with 
which children learn their languages, a theory which reduces the 
number of possible grammars applicable to some corpus has a better 
chcuice of being psychologicediy correct than a theory which merely 
reduces the number of formal devices available to grammars. 



Most of the effort expended in studying perceptual strategies 
so far has been aimed at providing proof of their existence as well 
as adequate formulations of them. Not too much attention has been 
paid to the order in which strategies must apply, presumably for at 
least the following reasons: (a) the order of application of specific 
strategies is not at present amenable to direct testing, and (b) the 
only kind of ordering regarded as interesting in grammatical 
investigations has been the extrinsic one, and an extrinsic ordering 
of strategies appears quite implausible; the reason is -that extrinsic 
ordering in grammars is aimed at limiting the applicability of 
specific rules, while extrinsic ordering in perception would seem to 
be perversely aimed at limiting the efficiency of specific decoding 
operations. In the remainder of section l*.l, I shall try to show 
that the ordering of perceptual strategies is not without theoretical 
interest, as even non-extrinsic ordering constraint the class of 
possible outputs. Specifically, I will consider three syntactic 
problems which cannot, as far as I can see, be handled with the 
presently available formal devices without loss of generality, and 
will argue that a satisfactory solution can be provided by a consider- 
ation of the order in which perceptual strategies apply to the 
pertinent structures . 

1*. 1.1.1. The first problem I shall consider is Ross^ (1967) 
•output condition, usually referred to as the NP-over-S Constraint: 

(^.1) Internal sentences exhaustively dominated by 
the node NP are unacceptable. 

ik.l) marks as unacceptable sentences like (l*.2a) and (A*. 2b), but 
does not affect sentences like (l*.2c) and (l*.2d). 

(1*.2) a. *It'8 that Bill may return which disturbs 
everybody . 

b. *It^s for Mary to leave that everyone is 

looking forward to. 

c. It*s Billys playing the piano at all hours 

which got the neighbors furious. 

d. It*s Billys loud playing of the piano which 

made his wife ask for divorce. 

True, gerundives like the underscored phrase in (l*.2c) behave like 
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sentences with respect to a number of syntactic phenomena, but Ross 
disposed of this problem by pointing out that constructions like 
the internal ones in U .2a)-(i*.2d) can be placed on a 'sentencehood- 
nominality* scale, and that (^.l) is sensitive only to the two 
highest degrees of sentencehood, rather them to absolute categories. 
As there are many syntactic phenomena which recognize such a 
categorial scale (for details, see Ross (in Press)), ^.his explanation 
can be accepted. 

There are, however, at least two problems with (i*.l). The first 
is that it does not always hold; thus, {h.3) seems to me quite 
tolerable, particularly if it is spoken with a slight pause after 
the underscored constituent (which is probably what most people 
woidd do), even though the internal phrase has the highest degree of 
sentencehood. 



(^.3) The doctor explained that John was sick to the 
many friends and relatives who had assembled 
in the hall. 



Second, it is not clear at what level (l*.l) must hold. Ross 
regarded (U.l) as an output (i.e., surface structure) condition, but 
this is clearly incorrect in view of the acceptability of (^'.l*a) and 
ik.hh). 

ih.k) a. Is where you live known to the police? 
b. Isn't whoever steals money a thief? 

If we accept the analysis proposed in various places in the literature 
that free relatives have lexiceJ. heads at some level of representation 
and that embedded interrogatives also originate with heads roughly 
like the answer to the question, the grammaticality of (i*.lia) and 
(^.^b) is explained if we require that {k.l) hold at the level which 
precedes the deletion of the heads of the internal clauses. However, 
the level in question cannot be a well-defined level of linguistic 
structure. In particular, it cannot be 'shallow structure*, if the 
latter is defined as the output of the cycle, in view of the 
unacceptability of ('*.5a); topicalization is a post-cyclic rule, and 
the head of the free relative must be present in the string at the 
point at which Topicalization applies for the latter to be blocked 
by the CNPC. If shallow .structure is defined as the level which 
precedes all post-cyclic deletions, then (^.l) cannot hold at that 
level either, for the head must be present when certain deletion 
rules subject to the CNPC apply, as shown by the ungrammaticality 
of (i*.5b). 

(?t>) a. *Bill, I have heard ^^^^ lUt''^^''^} ^^^^ ^^^^^ 
b. *This mountain is too massive for me to be able 



to conceive of °^J«°* which") destroyed. 

what J 



Notice that so far I have assumed that ik.l) is stated a;t the latest 
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level possible, lliere is no reason \Axy it could not in principle 
be stated in shallow structure, even if it is applicable later on 
as well; however, notice that Subject-Verb Inversion is a post- 
cyclic rule, and the underscored phrases in {k,k) would not yet be 
internal at the end of the cycle. One might want to state ik.l) at 
the level which precedes post-cyclic deletion, but this creates 
problems of another sort r Kiparsky and Kiparsky (1971 ) argued that 
the complements of factive verbs modify the NP the fact at some level. 
'This claim has been recently called into question by a number of 
writers (l shall return to this matter at the end of (1*. 1.3.1), but 
if we accept it for the time being, we can account for the difference 
in acceptability between {k.6a) and {k.Sh) by assiuning that the fact 
is still present in- the derivation of the latter at the stage at 
which the rule which deletes the coreferent of the rock applies. 
Consequently, the heads of factive complements must be deleted later 
than shallow structure, exactly like the heads of free relatives. 

{k,6) a. This rock is too heavy for me to believe 
that you intend to pick up, 
b.?*This rock is too heavy for me to have 
realized that you intend to pick up. 

However, if we required that {k.l) hold in shallow structure (defined 
as the level \rfiich precedes post-cyclic deletions), we predict that 
there is a difference in acceptability between (l*.7a) and (l*.7b) when 
in fact there is none. 

(1*.7) a. *Was that John was sick believed by the 
captain? 

b. *Was that John was sick realized by the 
captain? 

The import of the preceding discussion is that (a) at least 
with respect to factive complements, (4.1 ) does not hold at a well- 
defined level of linguistic structure, and (b) the heads of factive 
complements must be deleted earlier than those of free relatives—for 
no independent reason—with (U.l) holding, say, in the input to the 
rule which deletes the heads of free relatives. This solution, while 
not impossible, seems to me sufficiently ad hoc to suggest that it 
cannot receive an explanatorily adequate interpretation. 

1*.1.1.2. The second problem to be considered was raised in 
1.2.6, and concerns the fact that certain discontinuities within 
discontinuities, which are intuitiveDjr felt to be instances of the 
same phenomenon, cannot be stated in the same way in the presently 
available grammars. {Ipecifically, It was shown that movements out 
of clauses displaced by Extraposition-from-NP— as in (i^.Sd) — and 
movements out of pseudo-cleft .phrases— as in (U.9b)~or cleft ones — 
as in (l4.9d)— -can only be blocked by extrinsic order^ in the first 



Independent Justification for the ordering of the imbounded 
movement rules before Extraposition-from-NP is supplied by the 
paradigm below: 
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(i) a« You believe the girl who was sick to have 
left. 

t% *Wbich Gii^l do believe who was sick to 
have left? 

(ii) a. You saw a girl who was wearing; a red hat. 

b. Who did you see who was wearing a red hat? 

c. Who who was wearing a red hat did you see? 

As (i) shows, the head of a relative clause cannot be moved by 
Question-Movement. But precisely such a movement seems to have 
derived (iib) from viia) without causing unacceptability . The accept- 
ability of (iib) can be explained if Question-Movement derives (iic) 
from (iia) and Extraposition-from-NP then- derives (iib) from (iic). 
The ordering Extraposition-from-NP — Question-Movement could not 
derive (iib) from (iia), for the former rule is inapplicable to (iia), 
unless we allow vacuous Extraposition-from-NP, a move which has 
generally been deplored by transformational grairanarians. In addition, 
Ross (1967) shows .there are reasons for disallowing the questioning^ 
of a relative clause head after Extraposition-from-NP, as sugf^ested 
by the ungrainroaticality of (iiib), derived from (iiia) with the 
ordering Extraposition-from-NP— Question-Movement; in contrast, (iiic), 
derived from (iiia) with the opposite ordering, is more acceptable. 

(iii) a. Sam didn't pick up those packages which are 
to be mailed tomorrow until nine o* clock. 

b. *Which packages didn't Sam pick up which are 

to be mailed tomorrow intil nine o'clock? 

c. "Which packages didn't Sam pick up until nine 

o'clock which are to be mailed tomorrow? 



and by a global rule like (I.86) in the latter two. 

(14.8) a. A boy who kissed Mary Just walked out. ' 

b. *Who did a boy who kissed Just walk out? 

c. A boy Just walked out who kissed M^ry* 

d. *Who did a boy Just walk out who kissed? 

(14.9) a. I think that what John wants it to kill the 

girl. 

b. *The girl who I think that what John wantA* is 

to kill is pretty. 

c. It's John's courting that girl that I strongly 

condemn . 

d. *The girl who it's John's courting that I • 

strongly condemn is pretty. 

1*.1.1.3. The third problem to be discussed concerns the 
paradigm in (1.90), where the greater acceptability of ( 1.90b) as 
against (l.90a) requires a global rule like (1.91) ♦ 
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(1.90) a. »It's Mary (who) I discussed the claim that 
Bill likes. 

b. ?It's Mary (\Ao) I made the claim that Bill 

likes. 

c. It's Mary (\rtio) I claimed that Bill likes. 

(1*91) Violations of island constraints are considerably 
reduced in severity if the affected island was 
not an island in underlying representation. 

In 1.2.6, I pointed out that (1.91) > rather than extrinsic 
ordering, is needed to account for (1.90), because the configuration 
make the claim is needed in the cycle for cyclical niles like Passive 
and Raising to yield (l*.10a), or for Tough-Movement to yield (l*.10b). 

(U.IO)' a. The claim that Bill likes Mary is believed 
to have been made by Jo. 
b. The claim that John likes Mary is hard for 
me to make. 

What needs to be explained is why the postcyclic wbounded rules 
which move elements out of a complex NP should be sensitive to whether 
the latter is an underlying or derived island. 

U.1.2. The theory of perceptual strategies was descrr^-^d in 
Chapter Two. One point > however, bears repetition: the reax-time 
operations by which sentences are decoded are not, in all likelihood, 
mechanical replicae of the rules of a generative grammar applying in 
reverse order. One argument often givon in support of this position 
is thai the perceptuad and derivational complexities of sentences do 
not always coincide. An even more compelling argument against the 
claim that perception •undoes • derivations is based on the observed 
limitations of echoic memory. Thus, a great deal of work in cognitive 
psychology suggests that auditory memory is of at least two kinds: 
echoic and verbal (the terminology is Neisser's (1967)). Echoic 
memory is assumed to be a short-term store for unsegmented signals 
which have tindergone only global preprocessing operations, while 
verbal memory is regarded as a more permanent store for coded materia?,, 
obtaineC through a synthesis of the echo by the application of 
perceptual strategies. There have be* *^ arious attempts to measure 
the duration of echoic memory; thus, Guvt^man and Julesz (1963) 
estimate the duration of echoic memory as approximately one second 
long, Pollack (1959) puts it at about four seconds, while Ericksen 
and Johnson {^.96k) allow as much as ten seconds. The range of 
variability of the various measurements seems to pose a problem, but 
Neisser (1967, p. 205) points out that the time values obtained in 
the three sets of experiments were inversely proportional to the 
difficulty of the tasks that the subjects wore called upon to perform. 
If we regard apeech perception as a task of intermudlate difficulty, 
a fair estimate of echoic memory would place its useful duration at 
about four seconds. Consequently, a theory of perception which 
claims that transformations are undone anticyclically , and which 
therefore requires that no processing take place until the whole 
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signal has been stored for the highest cycle processes to be activated 
falsely predicts not only that nothing is understood until the end 
of a sentence is heard, but also that only that portion of a complex 
sentence which falls within the final four-sentence snan can be 
understood. For obvious reasons, a Rrammar which incorporates a 
pre- and/or a post-cycle in addition to the cycle is even more 
implausible as a real-time model. 

It seems more plausible to assume that there are units of 
syntactic perception (which, according to Lehiste (in press) are 
determined by suprasegmental structure) to which perceptual strategies 
• are applied in the order in which the units are presented. In 
addition to these 'local' strategies, we must assume 'integrative' 
strategies (which, very remotely resemble the generalized transforma- 
tions of early transformational grammar) which utilize suprasegmental 
cues, 'function words' i discontinuous constituents, etc., in order to 
synthesize the hierarchical relations between the units of syntactic 
perception into a coherent formal object. Thus, the strategies are 
presumed to make use of every cue in order- to emit a fairly narrow 
range of hypotheses concerning what may or must follow, and to strike 
out hypotheses incompatible with later in-coming cues, until the 
desired -reading of the sentence has been zeroed in on (or readings. 
If the sentence is linguistically and/or contextually ambiguous); of 
course, the strategies are heuristic procedures, and, unlike the 
analytic procedures of American structuralism, offer no guarantee of 
success . 

Concerning the order of application of perceptual strategies, - 
there have been some experiments suggesting that some strategies 
apply simultaneously while others apply sequentially. Thus, Mehler 
and Carey (1968) presented subjects with appropriate and inapprooriate 
pictures followed by a sentence containing either progressive or 
participial constructions like. (l».lla) or (l».llh) respectively (it 
was assumed that the latter was syntactically more complex than the 
former ) . 



Ci.ll) a. They are fixing benches. 

b. They are performing monkeys. 

The response times were fastest when the progressive construction 
was used ^d the picture was appropriate, but delayed by the same 
amount otherwise. Thus, it seemed to make no difference whether both 
the syntax and the semantics, or only one of them, were complex, 
which suggests that at least certain aspects of meaning and syntactic 
structure are processed simultaneously. Concerning the evidence for 
sequential application of strategies. Sever (1971 ) reports on a number 
of click-localization experiments which suggest that subjects are 
immediately aware of clausal3 boundaries, but not of boundaries 



^Lehiste (in press) points out that the recognition of clausal 
boundaries is probably itseJf not an immediate process, and occurs 
prior to the recognition of suprasegmental structure. Hhe suggests 
that Itever'g results were due to the fact that he asked his nubjects 
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to wait to the end of the sentences before reporting the location 
of clicks; she repeated the experiment asking subjects to react 
as soon as the click was heard, and got different results. 

It should also be pointed out that Bever also claims that the 
primary units of syntactic perception are deep , not surface , clausal 
boundaries. He bases his claim on the observation that clicks 
occurring in- the middle of . the troops tended to migrate to the 
right in (i) and to the left in (iij; thus, subjects seem to be 
sensitive to deep structure boundaries. 

(i) The general ordered the troops to fight, 
(ii) The general desired the troops to fight. 

Notice, however, that clicks cannot be attracted to deep clausal 
boundaries unless the elements separated by clausal boundary in 
deep structure are still adjacent in surface structure. Thus, it is 
not clear what the click should be attracted to in (iii) in order to 
support Bever *s hypothesis. 

(iii) John definitely seems to be stupid. 

In (iii), the deep boundary is between seem and John. Bever •s 
hypothesis becomes totally implausible if a deep structure allowing 
lexical decomposition is considered, for it woiad imply that if we 
decompose no man came as (NOT CSQME man came3), the boundary between 
NOT and SOME is a primary syntactic percept. 



between non-sentential constituents; however, if subjects are asked 
to wait several seconds before reporting the location of the click, 
non-sentential constituent boundaries do appear to cause response 
bias. This suggests that some structural features are processed 
earlier than others. 

Concerning the strategies which apply sequentially, we must 
recognize at least two kinds of ordering, 0^^ and Ob. 0^ is an exact 
replica of intrinsic grammatical ordering; that is, if some strategy 
A creates the very input to some strategy B, it follows that A must 
apply before B,. As a hypothetical illustration, consider the following 
possible way in which the. processing of {k.l2) may proceed. 

{1*.12) Who do you think John fears that Mary dislikes? 

If the primary units of syntactic perception are suprasegmental 
rather than structural (as proposed by Lehiste), there must be 
strategies which recognize clauses and segment them into predicates 
and arguments. One way in which this mBy be done is through the 
identification of verbs and the retrieval of the syntacto-aemantic 
infonnation associated with them (from some long-term store corres- 
ponding to the lexicon); by recognizing the niamber and kind of 
potential arguments that a given verb requires, it is possible to 
emit tentative hypotheses about which arguments go with which verbs. 
The underscored word in (1*.12) is easily recognizable as a highly 
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probable member of a discontinuity, and as the analysis by synthesis 
of the incoming signal proceeds, the vh-vord is tested as a possible 
argument of the various verbs encountered. Specifically, vho is 
not considered as a possible object' of t hink , as the latter requires 
abstract objects, nor of fear , for although this verb may take 
animate objects, it can take only one object, and is already followed 
by one; therefore, vho can only be an argument of dislikes . .If this 
oversimplified account is in .any way correct, the strategy which 
attempts to^atch who with some verb can only apply after information 
as to the selectional restrictions of that verb has been retrieved. 

0^ depends on the number of cues required for the triggering of 
specific strategies, on the moment in time when those cues become 
available, and on the complexity of the task which the strategies in 
question are called upon to perform. I believe that Oo^ can provide 
an explanation for the problems raised in 1.1.1 and t.1.1.2. Of 
course, there is no reason why specific strategies should not operate 
simultaneously, if the pertinent cues are simultaneously provided,^ 



^It is not clear whether simultaneous application of decoding 
operations involves peureLLlel independent processing or very rapid 
shifting back and forth from one task to another. If we accept 
Neisser's (1967) views that synthesis, as opposed to global pre- 
processing, is strictly sequential, we are forced to entertain only 
the latter hypothesis. 



which is, I suggest, the explanation for the problem raised in U.1.1.3. 

U.1.3. In this section, I propose solutions to the problems 
in U.1.1. 

U. 1.3.1. Before considering a solution to 1.1.1, it is 
important to recognize that (U.l) is probably a perceptual, rather 
than a formal, constraint. The point is that it is difficult to 
identify a constituent which forms a perceptual unit with the flanking 
material when the category to which that constituent belongs is hard 
to identify. Thus, the internal constituents in (U.2a) and (^.2b) 
are arguments of a higher predicate and therefore NP*s; however, 
they are too sentence-like to be readily recognizable as NP*s. On 
the other hand, the internca constituents in {U.2n) anc (^.2d) are 
sufficiently noun-like (they begin with genitives, i.e., determiners) to be 
quickly identified as such. With respect to (^.3), where the under- 
scored constituent is followed by some suprase^ental boundary, we 
are faced with a clause inte^ nal to the containing clause, but not 
to the containing perceptual unit (recall Lehiste^s hypothesis that 
perceptual units are determined by ^suprasegmental structure), and its 
recognition as object of explain is thereby facilitated. It appears 
that the notion •internal NP-over-S^ must be reformulated in terms of 
containing perceptual tuiits , rather than clauses, which is not at 
all siurprising if ih.l) is a perceptual, rather than a purely formal, 
constraint . 

.If (^.1) represents a limitation of the strategy which identifies 
arguments of higher predicates, then the contrast in acceptability 
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between (i*Aa) and (U.Ub) and ('♦.Tb) is rather easy to explain: the 
heads of free relatives or embedded interrogatives sire signaled by 
the cheiracteristic j^-pronoun, and can therefore be reconstructed 
before the clause is eissigned a role in the containing sentence; in 
other words, the- strategy which reconstructs the heads of wh- clauses 
can operate before the strategy which segments out the arguments of 
a higher predicate, in which case the latter is no longer blocked 
by (U.l), for the internal clause is no longer exhaustively dominated 
by NP. In contrast, a mere inspection of the underscored constituent 
in (U.Tb) will not tell us whether this constituent does or doesn't 
have a deleted head; it will be necessary to wedt for the higher 
predicate, and even then the reconstruction of t he fact cannot employ 
overt clues, like the wh feature, but must base itself on the meaning 
of realize , from which it raa^ be inferred that the truth of the under- 
scored constituent is presupposed, and that it consequently modifies 
a head the fact * Thus, the availability of cues as well as the 
relative complexity of the task that str^ategies have to perform is 
consistent with the ordering in (U,13), which provides a straight- 
forward explanation of the contrast between (u.U) and (U.7b). 

(U.13) (i) Reconstruction of the heads of wh-clauses ; 
(ii) Role assignment to constituents within the 

containing sentence (constrained by (U,l)); 
(iii) Reconstruction of the heads of f active 
complements . 

(U.13) is a little too strong in eisserting that step (iii) 
does take place; in fact, the data we have considered allow only the 
weaker statement that step (iii) ( if it takes place at -all ) lAust 
follow both steps (i) and (ii). This weaker statement is in order 
since there is no evidence that the fact is reconstructed at all in 
the decoding of (^.Tb). 

In R. Lakoff (1972 K it is argued that there exist certain 
difficxilties with the Kiparskys* analysis of factives, and it is 
suggested that their behavior might be more adequately accounted for 
by invoking their semantic properties (in a way unknown at present) 
than by postulating j^head NP the fact . Notice, however, that even 
if Lakoff •s arguments €u:e accepted 'and the Kiparskys* analysis is 
rejected, it does not follow, from this that hearers necessarily do 
not reconstruct a head for the complement of realized in interpreting 
(U.7b). To see this, consider embedded interrogative clauses like 
I^ know who left , which have sometimes been analyzed as I know the 
answer to the question: who left . The status of the head the answer 
to the (question is at leMt as questionable syntactically as that of ' 
the head th e fact which the Kiparskys propose. For example, adverbs 
modifying interrogative clauses c€m sometimes be prepooed beyond the 
matrix verb, as in in case you have a minute^ I wonder whether I 
could tcLlk to you (cf . * in case you have a minute » I wonder about the 
answer to the question whether I could talk lo you )'; also, some 
interrogatives are not islands for the purposec^of unbounded rules, 
as shoxm hy it^s Mary I vender why Bill left (cf . * it^s Mary I wonder 
about the answer to the question why Bill left )^sNow, the fact that 



interrogative s do not seem to modify a Nucleus in some cases does 
not mean that such a Nucleus is never reconstructed, for is vhy he 
left a real mystery? or it*s vho vent vhere that I am puzzled by 
are just as frood as i s vhat you found on the table the tissue ? 
Consequently, it appears. that hearers do reconstruct heads for 
interrogatives, although the syntactic justification for such heads 
is just as shaky as that for the heads of f actives. 

If one wanted to deny that internal interrop,atives are 
reconstructed with heads, one would have to deny that the exhaustive 
domination of the. node S by the node NP has anyfhin^^ to do with the 
unacceptability of (l4.2a) or ik.2h). While such a step is possible 
in principle, I propose that it not be taken until a viable alterna- 
tive is proposed. 

I4.I.3.2. We now turn to problem 14.1.1.2, which consisted of 
the difficulty of stating the ungrammaticality of (l4.8d) and (l^.pb) 
as the same phenomenon. Due to specific properties of the" formal 
descriptive model, the lower level discontinuity has to be created 
before the higher level one in (l4.8d), while the opposite ordering 
obtained in (l^.pb). The two cases will, however, be reduced to the ■ 
same case if it can be shown that when the integration of the two 
discontinuities is attempted in real time, the existence of the higher 
level discontinuity is recognized before the integration of the lower 
level one is carried out. That is, when the interrogative or 
relative pronoun is matched with the pseudo-cleft constituent or. the 
extraposed clause respectively, the perceptual device must have been 
already apprised of the fact that these larger constituents have 
been torn off complex NP*a. It seems to me that this condition is 
guaranteed by the order of presentation of the various clues. The 
pseudo-cleft construction is a highly characteristic one, consisting 
of a free or full relative clause, a copula, and a pseudo-cleft 
constituent, in that order; therefore, by the time the pseudo-cleft 
constituent is presented, the perceptual device will have been given 
more thar enough cues to' recognize that this constituent enters a 
discontinuous relation with the subject relative clause^ (sentences 



-^Of course, a sequence of relative clause followed by copula 
need not be continued with a pseudo-cleft constituent, as (i) 
shows . 

(i) What John ate was wonderful. 

However, one of the things that speakers know about pseudo-cleft 
constructions is that the pseudo-cleft constituent, enters into 
selectional restrictions with some verb in the relative clause. As 
there is no *John ate wonderful , a hearer will easily recognize 
that (i) is not a pseudo-cleft construction. 

As pointed out in Bach (1969), there exist ambiguous construc- 
tions like what Descartes discovered was a proof of his exlotenco . 
In such cases, we predict that the ambiguity is resolved when an 
element of the predicate has been chopped; this is confirmed by (il), 
where the a-sentence is ambiguous, but wliere the b-one is not, an 
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the non-pseudo-cleft reading alone is grazmaticflJ.. 

(ii) a. What I saw was a representation of that 
building* 

b. That building, which what I saw W€is a 
representation of, has collapsed. 

like (U.9d) can easily be accounted for in the same way, as the cleft 
construction is also highly cheiracteristic). With respect to extraposed 
clauses as in (l*.8d) or (U.lU), 

{h.lh) Who do you believe Vithe^claim 1 ^^^^ ^^'^'^ 

kissed? c a mj 

they are. recognizable as relatives or complement clauses immediately, 
for they must be introduced by complementizers; indeed, as (U.15) and 
(U.l6) show, the complementizer can be deleted when the clause is in 
construction with its head noun, but not if it is extraposed. 

(U.15) a. A boy you met recently has just arrived, 
b. *A boy has just arrived you met recently. 

{h.l6) a. The rumor John is sick is unconfirmed. 

b. *The rumor is unconfirmed John is sick. 

A clause in sentence-final position can be either an argument of the 
sentence predicate or a dependent of some head noun; by the time the 
extraposed clause is presented, the remainder of the sentence .has 
already been hecurd, and it becomes possible to see whether the 
clause is more appropriately matched with a preceding NP or with 
the predicate. 

If the clause is matched with an earlier NP, any discontinuity 
involving this clause and some external element will be recognized as 
a violation of the CNPC. 

U.1.3.3. Concerning the problem in U.1.1.3, namely, the 
distinction in acceptability between ( 1.90a) and (l.90b) can be 
accounted for quite straightforwardly in terms of the Mehler and 
Carey experiment mentioned in k.1.2. Thus, assuming that speakers 
have an internalized version of the lexicon, in which lexical items 
are characterized phonologically, syntactically and sert^antically (as 
I suggested in the discussion of (i*.12)), the recognition of a lexical 
item should involve the retrieval of its characterizing information, 
its syntactic and semantic properties thus becoming available 
simultaneously. Therefore, -ut the point at which a fiearer attempts 
to integrate the discontinuity in (l.90b), he has already recognized 
that the claim looks like the head of a complex NP, but is not one, 
semantically. Quite likely, make the claim is stored as one lexical 
item, in fact, a sententieil idiom, and it is a general fact about 
idioms that (at least part of) their apparent structure is not 
relevant for interpretation. As we shall see in U.2.5 structureil 
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configurations not relevant to semantic interpretation cannot 
function as islands even when they are represented as islands in 
underlying representation. 

The somewhat more awkward character of (l.90b) as compared with 
(1.90c) may be due to the conflict between what the phrase make the 
cl^im that v.. . seems to be and what it really is . Notice that the 
simultaneity of the processing of syntactic and semantic cues is 
important; for, if the latter were processed si£giif icantly later 
than the former (as one would expect, if perception undid the 
derivations of, say, an Aspects- model grammar), (1.90b) would very 
probably be perceived as ungrammatical in the initial processing 
stages, and further processing would be discontinued, thus never 
giving the semantics a chance to right the situation (cf. (1.90b), 
which is immediately interpretable , with Bever's the horse raced past 
the barn fell ^ which might not be if raced is interpreted as a 
preterite )". 

^.1.1*. I have tried to show in section k.l that the ordering 
of strategies has significant effects on the acceptability of 
sentences . 

Thus, I argued in 1+. 1.3.3 that the simultaneity (or near- 
simxiltaneity) of syntactic and semantic processing can account for 
the almost complete acceptability of (1.90b). 

In 1.3.1* I attempted to show that the ordering proposed in 
(^.13) can provide an explanation for the contrast in acceptability 
between {k.ko,) and {k.kh) and (ii.7b). That is, in order to account 
for the facts, the heads of the internal clauses must be recognized 
before the clauses are assigned a role in (i*.i*a) and (i*.i^b), but 
after that in (^4.7b). The unacceptability of (^.7b) is particularly 
important, for It shows that (l*.13ii) fails if the head of the internal 
clause is recognized ^too late^. 

In U.1.3.2, I showed that the two sentences at issue, namely 
(i*.8d) and (l*.9b) can receive the same explanation, since in both 
cases the higher level discontinuity is recognized before the lower • 
level one is resolved. However, in order to show that the order in 
which the discontinuities are integrated is significant rather than 
accidental > it would be necessary to find a grammaticgl sentence in 
which the existence of the higher level discontinuity would not be 
recognizable when the flower level one is. resolved. Constructions 
satisfying this requirement may be hard to come by. I have, neverthe- 
less, been able to construct an approximately adequate paradigm in 
Japanese, which fortunately has both leftward and rightward 
unbounded processes capable of violating the CNPC; such processes 
are Topicalization and Helativization respectively. In Hoss (I967), 
it is pointed out that complement clauses modifying ^lexical • head 
nouns like syutyoo (the claim) are impervious to Helativization, 
while clauses modifying the dummy koto (thing) are not. As the head 
nouns of such constructions invariably appear on the right of the 
modifying clauses, one may hope that Topicalization would be free to 
move elements out of complement clauses, since at the stage at which 
the discontinuity is integrated, there would be no way of knowinr 
whether the head of the complement clause is a lexical element or a 
dummy. In other words, there should be no more difference in 
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acceptability between (U.lTc) and (U.lTd) tnan there is between 
^♦•7aJ and (U.7b), while there should be a difference between 
(U.lTb) and (U.lTd)* 

(1*.17) a. otto ga kabutte ita koto e watakusi ga 
Otto wearing was thing I 

sinzita boosi wa akai* 

believed hat red 
'The liat vhich I believed that Otto was 

wearing is red. • 

b. (*)otto ga kabutte ita to iu syutyoo o watakusi 

Otto wearing was that say claim" I 
ga sinzita boosi wa akai. 
believed hat- red 
•The hat' which I believed the claim that Otto 
was wearing is red.* 

c. Otto wa, watakusi wa knno boosi .o kabutte 
Otto I this hat wearing 

ita koto o sinzita, 
was thing believed 
*ptto, I believed was wearing this hat.* 

d. (*)otto wa, watakusi wa kono boosi o kabutte 

Otto I this hat wearing 

ita to iu syutyoo o sinzita, 
was that say claim believed 
•otto, I believed the claim vas wearing this 
hat.» 

However, the few Japanese informants I have been able to consult have 
assured me that (U.l7c) and (i+.lTd) differ in acceptability, while 
{U,17b) and (U.17d) do not A While (U.lTd) does not offer a perfect'" 

^The star in parentheses in (U.lTb) and {h.m) indicates that 
the corresponding sentences are unacceptable in some dialects, but 
not in all dialects. 



test case, since the head and the complement clause are adjacent 
(i.e., there is no higher level discontinuity), its unacceptability 
is nevertheless sufficiently significant to suggest that the fact 
that (4.8d) and {k.9h) could receive the same explanation was something 
in the nature of a lucky accidents 

The lack of a distinccion in acceptability does not, however, run 
counter to the explanation I offered in U.1.3.2. Thus, the fact 
that we find no distinction in acceptability between (U.7a) and (U.7b), 
but we do find one between (U.17r;) and (U.17d), is due to the fact 
that there is a distinction between the NP-over-S Constraint and the 
CNPC, the former being a perceptual , and the latter a grammatical 
principle (in fact, a grammatized one, as I will argue in k.3). ^I^ut 
differently, the latter, but not the former, is a part of the 
competence of speakers, and while we would expect a perceptual 



- 168 - 



difficulty to arise only when the string to be processed is complex 
in the appropriate way, there is no reason why such an expectation ^ 
should be entertained with respect to a grammatical (or gramnatized) 
constraint. Specifically, even though a hearer presented with (J4,17d\ 
would in all probability not know, at the crucial moment, that the 
complement clause with which the tppicalized constituent is integrated 
is part of a complex NP, he would still know that violations of the 
CNPC are unacceptable and would therefore strike out the possibility 
that the clause in question might modify a lexical head noun. The 
discovery that syutyoo , rather than koto , is the head of the complement 
clause conflicts with the initial analysis of that clause, and 
unacceptability arises at that stage. 

It is worthwhile to pause for a while and consider how the above 
account, if correct, fits within the perceptual framework we have been 
using 30 far. 

Clearly, the difficulty arising from the categorization of the 
complement clause modifying syutyoo in (U,17d) as both a Satellite 
and a non-Satellite is a typical instance of perceptiial conflict. 
In addition, it is interesting to note that the initial discarding 
of the possibility that the complement clause be a Satellite bears 
certain, similarities to erroneous closure; thus, in both situations, 
an a priori theoretically possible analysis of some substring is ruled 
out 'too soon* . An even more interesting case which may be regarded 
as involving 'erroneous partial c3)sure' will' be discussed in U.2, 
in relation to the relative complexity of low predictability. In 
essence, it will be argued that, the lesser complexity of predictable 
structures leads to the hypothesis that they are most likely to 
occur than non-predictable ones; consequently ^ the possibility of 
finding some desired constituent in an unlikely place is 'partially 
ruled out', as it were, and this, it is claimed, is partially 
responsible for the SSC. 

To return to the problem discussed before, an English paradigm 
which demonstrates the same point as (U,.17),is perhaps (U.l8); (l*.l8) 
is not an ideal test case either, since informants are far from 
unanimous in accepting its acceptability ratings. 

('*.19) a. John referred to the roof of the house in his 
discussion with Mary. 

b. I've Just seen two birds on the roof of the 

house . 

c. It's the house which John referred to the roof 

of in his discussion with Mary. 

d. *?It's the house which I've Just seen two 

birds .on the roof of. 

e. The roof of the house, John referred to in his 

discussion with Mary. 

f. The roof of the house, I've Just seen two birds 

on. 

?It's the house which the roof of, John referred 
to in his discussion with Mary, 
h. *It's the house which the roof of, I've Just 
seen two birds on. 
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The crucial constituent is the roof of. the house > which forms an 
adverbieuL Satellite in b, d, f , h, but not in a , c, e, g. The c 
and d pair shows that the maiming of that, constituent is considerably 
worse in the adverbial case; the e and f pair shows that the constituent 
in question cm be topicalizod in either case; the g and h pair, in 
which the constituent is both topicalized and maimed, shows that the 
overall recult is considerably worse when the constituent at issue 
originated as an adverbial. 

Clearly, at the stage at which the topicalized element is 
encountered in the g or h sentence, there is no way of telling 
whether it is a Satellite or not. But the h sentence is much worse 
than the g one, and this fact again suggests that language users call 
upon their knowledge of the Subservient Satellite Constraint to mark 
the topicalized constituent as a non-Satellite. As in the Japanese 
case, thisrinitial analysis is incompatible with the remainder of the 
string in (l*.l8h), and the sentence is therefore unacceptable. 

1*.2. This section is essentially devoted' to the mitigating 
effect of predictability (and the aggravating effect of unpredict- 
ability) on the complexity of interrupted behavior. 

I begin by reconsidering one parameter which Bever regarded as 
significantly determining the complexity of discontinuities, namely, 
structural complexity (see (2.8l)), and which, to the best of my 
knowledge, has never been' satisfactorily defined. Ross (1967) discusses 
the notion of 'cpmplex NP' in relation to the acceptability of 
sentences which have imdergone Particle-Movement , -and proposes that 
*an NP is complex if it dominates the node S*. On this basis, he 
accounts for the greater unacceptability of (l*.19c) as compared with 
CI*. 19a) or (l*.19b). 

(1*.19) a. John called the unusually pretty girl up. 

b. John called the girl from San Francisco up. 

c. ?John called the girl who was from Can 

Francisco up. 

Ross also points out that structural complexity is probably not a 
simple function of length, as he finds (l*.20a) more acceptable than 
(l*.20b) . - . 

(1*.20) a. I called almost all of the men from Boston up. 
b. I called the man you met up. 

It appears therefore that, with respect to Ross* dialect (for not all 
speakers share his Judgments about (1*.20)), structural complexity 
depends, to some extent, on the occurrence of the node S between 
discontinuous components > However, it appears that even though most 
speakers find. (l|.19a) better than (l*,19c), some speakers find (l*.20b) 
as acceptable, or even more acceptable, than (^♦20a). This suggests 
that the length, as well as the number of nodes (i.e., constituents 
and subconstit'uents) , of the intervening material plays some role in 
determining its structural complexity, and the dialectal differences 
rfre presumably to be ascribed to the different • weights * given to 
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each contributing factor in specific idiolects. The situation is 
in fact even more complicated, for certain speakers find (l*.19b) 
intermediate in acceptability between (U.19a) and (l*49c), even 
though the a and the b-sentences have roughly the same length and 
number of intervening nodes. Several explanations for this 
distinction could probably be found, one of which could be the 
higher degree of (perceptual) recoverability of the pruned S-node 
over from San Francisco as compared with the pruned S-no.de over 
unusually pretty . 

Clearly, a great deal more research must be undertaken before 
a satisfactory explication of the notion of structural complexity 
is arrived at. In the remainder of section U.2, I will consider 
severed cases whiqh suggest that a principle like (U.21) must be 
incorporated into any effort to further explicate structural complexity 

(U.21) The structural complexity of a string is 

inversely proportional to its structural 
predictability. 

1*.2.1.' Contract (l*.22a) and (l*.22b). 

(1*.22) a. John called the not very well liked but 

quite pretty girl on the next block where 
Jack had lived for six years up. 
b. Who do you think Mary told her friends that 
she saw at the party? 

Most speakers find (l*.22b) considerably more acceptable than (U.22a). 
If one of the main factors which make discontinuities complex is 
the number of intervening sentence nodes , then it is not clear why 
sentences like (i+.22b) should be better than (l*.22a), since there are 
more S-nodes between jrtio and at the party in the surface stihicture of 
(U.22b) than between called and u£ in the surface structv.re of 
(l*.22a). I believe the reason is that the structure cf the intervening; 
material in (l*.22b) can be essentially described with a simple 
recursive procedure, while the intervening material in (U.22a) is 
not amenable to such treatment. In other words, the intervening 
material in (l*.22b) can be generited by recursively applyin??; the set 
of rxiles in (U.23), while many more rules need to be specified for 
the generation of the intervening material in (l*.22a)* 

{k.23) (i) S NP VP 
(ii) .VP V NP 
(iii) JiP S 

Thus, to say that the rtinent material in (l*.22b) is more predictable 
is to say that there are fewer unrelated structural relations between 
its subconstituents which a language perceiver must discover while 
holding the first member of the discontinuity in memory. Consequently, 
the perceiver needs to carry out less processing on that material, 
and the burden on his immediate memory is alleviated. 

But, one may ask at this point, how does the language perceiver 
know in advance, at any point, that the incoming material will not 
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necessitate further processing, and that it can be taken care of » 
by and large, by reapplying an already discovered procedure some 
number of times? The answer is that a language user, who has had 
a certain amount of experience with language, has already extracted 
the generalization that some structures are recursively predictable 
while others are not. Thus, the language user knows (by induction 
over experience, or with the help of innate mechanisms, or both) that 
structures like (l*.22b) where the resolution of the discontinuity is 
to be sought somewhere ♦down the chain of command* by a procedure 
roughly like (i*.23) are comparatively non-complex structures, and 
therefore possible .ones . Therefore a hearer will project a recursive 
structure of the appropriate form, and, when scanning the linear 
signal, will experience much less difficulty if the second member 
of the discontinuity occurs where predicted by the recursive 
procedure . 

It is rather obvious that additional processing does affect 
acceptability. Thus, (l*.2l*) or (1*.25), in which the intervening 
material requires npt only the recursive application of (1*.23) but 
also some additional processing , are less acceptable than U.22byr 

('*.2l*) It*s John who I believe Joe, Dick and Mary told 
the two guys who were in love with Jill and 
Donna respectively that Bob hates most. 

(i*.25) Who do you think my girl friend from Chica^?;o 
suggested to the boy who lives across the 
street that Jill loves best? 

Let us consider at this point Ross* HIC (see Chaptei* One) which 
claims that a chopped element must commaJid its place of departure (in 
fact, this constraint applies not only to chopping, but to all the 
situations in which the stronger island constraints apply, i.e., 
feature-changing, deletion under obligatory coreferentiality, semantic 
neutralization, etc.). Ross hypothesized that the HIC is a universal 
fact about human languages, and, as far as I know, no counterexamples 
have been discovered. I believe that the HIC can be at least in 
.part explained by (l*.2l), since (as I shall argue in detail in 
section U.k) the situations in which the various island constraints — 
including the HIC~hold, are precisely the situations in which a 
structural discontinuity exists. Notice then that the HIC reduces 
the complexity of discontinuities in at least two v^ys: (i) by 
increasing the predictability of the interdiscontinuous material 
through a recursive procedure, i.e., (1*.23), and (ii) by making it 
easier to predict the location of the second member of the discontinuity, 
as the latter needs to be sought only down the chain of command, 
rather than anywhere in the tree. 

In the remaining subsections of i*.2, I will discuss several 
phenomena involving discontinuities in which the following situation 
obtains: 

('♦.26) Acceptability increases in proportion to the 
predictability of (i) the structure of the 
interdiscontinuous material, and/or 
(ii) the location of the second member of 
the discontinuity. 
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't.2.2. In2.1.3»I proposed the perceptual principle (2.83). 

(2.83) Two interrupted sentences are perceptually 

complex if one of them is at least in part 
internal to the other. 

One of the cases in which (2.83) takes effect concerns the RRC"» 
where there exist two discontinuities, such that the second is 
createe by^the„resolution of the first. The difficulty is to 
resolve both discontinuities properly when the location and/or 
structure of the second member of each discontinuity is not easily 
predictable, as in (1». 27). In note 19 of Chapter Two, I argued that 
sentences like .28a)- aa-e much more acceptable than (1».27) because 
the former does .lot satisfy (2.83), the interrupted sentences being 
all resolved simultaneously and none being therefore internal to any 
of the others. 



(1».27) 



(1».28) 



??Vhat I will tell the jury that John thinks 
about tomorrow is his luscious mistress. 



a. 



b. 



Mary cooked, John found, and Bill ate, the 
cake. 

John bought, Mary cooked, and Bill drank, 
a bicycle, a cake, and a bottle of 
champaign respectively. 



The greater acceptability of (l».28a) with respect to (l».28b) can 
be additionally illuminated by (k.26). Indeed, in (l».28a), as soon 
OS the hearer has realized that Mary cooked is a maimed sentence, 
and that what follows is coordinated to it, he will be able to 
predict that the following coordinate terms will all be maimed in 
'parallel fashion' (in the sense of note ll», Chapter Three), whatever 
their number. Also, as claimed by (l».27i), the number of interrupted 
sentences will not seriously increase the complexity of the inter- 
discontinuous material, since the latter is essentially predictable 
by the recui-sive application of the rule X -»■ X ( "CS^J ) X. 
• With respect to the requirement made by (l».26ii), the knowledge 
language users have about the scope and nature of Coordination 
Reduction makes it possible .to predict that the resolution of the 
discontinuities will follow immediately upon the presentation of the 
last maimed coordinate term. 

The most interesting case for our purposes is (l».28b), since 
it violates (2.83) ♦ the interrupted sequences being separately 
continued, and nonetheless it is much more acceptable than (1».27). 
Notice that in (l».28b) the resolution of the various discontinuities 
is Just as predictable as in (l».28a). indeed, a sentence like (l4.28a) 
arises when the elements moved by Coordination-Reduction are nil 
identical, and a sentence like (l».28b) arises when the moved elements 
are not all identical; in the latter case, we canppredict that: (a) 
the rt olution of the discontinuities will be a constituent of the 
same kind as the constituent formed by the Interrupted sequences , 
namely, a coordination; (b) the two coordinations will have the name 
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number of terms; (c) the discontinuities will be resolved in the 
order in which they were presented, i.e,, the nth term of the first 
coordination will-be resolved by the nth term of the second 
coordination. 

Thus, we can see that considerations of predictability can 
improve the acceptability of sequences which one wo\ild otherwise 
expect to be unacceptable (in the case of {U.2&b). in view of ' 
(2.83)). 

The third phenomenon I shall consider with respect to 
predictability is the SSC (Sentential Shbject Constraint). Ross 
(1967, section U.k.2) points out that the unacceptability of sentences 
in which the SSC has been violated is due, to a large extent, to the 
fact that the NP-over-S Constraint is also necessarily violated by 
the movement of the chopped constituent to the left of the headless 
sentential subject (he gives credit to G. Lakoff for this observation) 
However, Ross also points out that the SSC is still necessary because 
violations of the SSC, added to violations of the NP-over-S Constraint 
create slightly greater unacceptability than violations of the latter 
alone. I reproduce two of the examples with which he illustrates his 
claim below; in (l*.29b) and (l*.3Ga), the NP-over-P Constraint is 
violated, in (i*.29c) and (i*.30b), both the NP-over-S Constraint and 
the SSC are violated. 

(^.29) a. That I brought this hat seemed strange to 
the nurse, 

b*??The niirse who that I brought this hat seemed 
strange to was as dumb as a post. 

c. *The hat which that I brought seemed strange 
to the nurse was a fedora. 

(^.30) a*"??I deny that that McXntyre has some money is 
certain. 

b. *I deny that that Mclntyre has any money is 
certain. 

A second reason mentioned by Ross for believing that combined 
violations of the SSC and the NP-over-S Constraint are worse than 
violations of the latter alone has to do with the Nominality-Sentence- 
hood scale (see Appendix Two). Thus, the NP-over-S Constraint is 
applicable when the internal constituent is a that- or for-tc clause, 
but not a gerund iai, nominalization, or anything even more nominal;, 
the SSC, on tho other hand, is applicable when the maimed subject is 
^ that-clause. a forrto clause, or a gerundial. The contrast in 
applicability to gerundials of the two constraints is shown in (i*.31a) 
and (^.31b) respectively. 

(^.31) a. Is playing that piano all right with you? 

*That piano, which John's playing is offensive 
to me, is a nteinway grand. 

Thus, it seems that the applicability of the r>nc on top of the J»P^ver- 
S Constraint extends the domain of the latter one further ntep towardn 
the nominality pole. 
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The reduction in acceptability ;^ich ('♦.29)-(l*.3l) i -licate 
sho\ild be associated with the SSC, is, I suggest, due to the fact 
that in structures in which the SSC has been violated, a simple 
recursive search procedure for the second member of the discontinuity 
becomes inapplicable; the result is greater psychological complexity, 
as predicted by (l*.26ii). Indeed, notice that the search .for the 
slot^of a constituent which has been chopped out of an object clause 
can 'proceed as follows: (a) search for the verb of the first clause 
which occurs in the inter-discontinuous material (this has to be 
done in any case, since the segmentation of sentences is probably 
carried out, to a largis extent, by comparing the number of expected 
arguments of the verb with what is actually present in the string); 
(b) move to the right of the verb, and see whether the chopped 
element can Jbe an argument of that verb; (c) if the answer to (b) is 
no, the verb has a clausal object; reapply step (a) to this clause, 
and continue until (b) succeeds. The procedure outlined above 
moves to the right at every step; however, if the chopped element 
belongs in a. subject , the procedure must look outside the recursively 
predictable areas. This step causes some perceptual complexity. 

An independent piece of evidence which supports my account of 
the SSC is .provided by the maimability of topicalized constituents, 
which exhibit '^he seme paradigm as subjects with respect to the* 
nominality scale, as can be seen by comparing (1*,32) and (1*.33). 

{h.32) a. *It*s John who that you killed is clear. 

b. *It's John who for you to meet may be dangerous. 

c. *It*s John who meeting may prove detngerous. 

d. It's John who the shooting of has become 

imperative. 

e. It's John who a picture of is in my pocket. 

(^♦.33) a. *It's John who that you killed yesterday the 
FBI finds incredible. 

b. *It's John who for you to meet at noon the 

FBI finds r^sirable. 

c. *It's John who meeting at noon the FBI regards 

as dangerous. 

d. (?)lt's John who the shooting of in cold blood 

the FBI has been too ready to condone. 

e. (?)lt's John who a picture of taken on D-day 

tlie FBI has u?en trying to obtain. 

Not all informants tolerate topicalization in embedded clauses , but 
those who do feel that the d and e sentences are considerably better 
than the remaining ones in (1*.3?). Thus, for the speakers who 
agree with the judgments in (1*.32) and (1*.33), an explanation is 
available on the grounds that the recursively predictive procedure I 
proposed above is violated in the same way by the maiming of subjects 
or of topicalized constituents. Notice- that these two phenomena are 
not statable as on^^ constraint in Ross* framework, since some mark 
over topicalized constituents (such as ADV, or TOP) is probably 
.needed in order to distinguish formally between topicalized elements 
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and subjects; consequently, the explication of (U.33) along Ross* 
lines would, in all probability, require an ad hoc constraint in 
addition to the SSC. 

An interesting prediction that my accouul of the SSC makes is 
that the SSC is operative only in surface SVO languages, as. it is 
only in such languages that arguiments may be positioned on both sides 
of a verb. The languages in which Ross pointed out that the SHC 
holds are indeed SVO .ones, while the only language he mentions in 
which the SSC does not hold is Japanese, an SOV language. The failtire 
of the SSC to hold in Japanese is illustrated in 

(U.Sl*) John-ga suki ta to.iu koto ga.-hakkiri shite iru 
John likes obvious 
so no uchi wa kiree da. 
house pretty. 
*'The house which that John likes is obvious is 
prett:" . ' 

Of course, no hypothesis can be regarded as confirmed on the 
basis of one languc^je, but I have found two more languages in which 
the S?C fails to hold, and they are both SOV. 

Tnus, Hankamer (1971) cites the following example from Turkish: 

Mehm^din yiyecegi gflpheli olan meyva. 
Mehmet^s going to eat doubtful being fruit 
*'-The fruit which that Mehmet will eat is doubtful 

The well-formedness of (3.35) is significant, for, as Hankamer points 
out, the Turkish rule of Relativization — which .always operates by 
deletion, like the Japanese one — is subject to the CNPC. 

The second SOV language which disobeys the SSC is Hindi . Hindi 
Relativization operates by movement toward the relative clause head, 
which usually precedes the clause. 7 



7 

In fact Hindi- supports the claim made in Drachman (1970) that 
Relativization proceeds by copying and deletion. Thus, a pro-form 
may optionally be left behind, as (i) tshows, 

(i) vo admi Jiske bareme betaiahe ki C ^ bimarke 
the man whom about ftold you the (he) sick 
^acukahe. 
has left 

•The man about whom I told you is sick has left.' 

In addition, the relative clause and its head may be permuted, in 
which case, a copy must appear in the place of departure of the 
relative pronomi, as (ii) shows, 

(ii) Jiske bareme betaiahe ki ^ ^i^^^h^ vo 

whom rabout I told you that (he) sick is the 
admi ^acukahe. 
man has left. 
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The facts in (i) and (ii) require the ordering in (iii): 

(iii) a. Relative pronoun copying; 
^ b. Relative clause preposing (or, perhaps • relative 

head postposing); 
c. Copy deletion. 

Independently of ordering considerations, the facts in (i) and (ii) 
above are interesting, as they suggest that surface structure cannot 
be distorted without limit; thus, if the optional (iiib) applies, 
(liic) is blocked, for the former creat^^s-decoding problems by 
causing the relative pronoun and tne relative clause head to be 
non-adjacent , and the further perce^^tual difficulty which the 
deletion of the copy brings about can no longer be tolerated. 

Hindi Relativization is subject to the CSC, as shown in {h.36) 
However, as shown by (U.37), ",his rule is not subject to the SSC, 

(U.36) »me larka deikta hfl yiske bareme tumhe betaiahe 
I boy 'see whom about I told you 

ki John ketaak? nehi posand kerta aor Mary 
that John books dislikes and Mary 

posand karti he 
likes 

*•! see the boy about Mhom I told you that John 
likes books and Mary dislikes. • 

(^ 7) vo larka jise Mary cumti he tik he vo biire^- he. 
the boy who Mary kisses good is he sick is 
*»The boy who that !4ary kisses is a good thing 
is sick.' 

k.2.h. The fourth case I wish to discuss concerns the 
possibility of deriving (l*.38b) and (k^Be,) (see Ross (1967; section 
• 3 y j • 

f (^.38) a. The government presdribesthe height of the 

lettering on the covers of the reports, 
b. Reports which the government prescribes the 
heig^it of the lettering on the covers of 
are invariably boring. 

As I pointed out in section 3.2.1*, sentences like (l*.38b) 
constitute an apparent violation of the Subservient Satellite 
Constraint, as the moved element is not only a Satellite, but also 
an element of some other Satellite(s) . By using (1*.26), wrSSy 
expect to explain the possibility of violating the Subservient 
Satellite Constraint in this case, since the inter discontinuous 
material is indeed predictable by a simple recursive procedure 
roughly like {h.39)* 
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(U.39) NP NP NP^ Cwhere the subscript on the second 
NP on the right of the arrow is an ad hoc 
abbreviation to which no theoretical si^nifi- 
cancG should be attached indicating that thiit NP 
is a dependent of the preceding oneD. 

This procedure is exceedingly simple, since it predicts a chain of 
i<P s, each of which is a dependent of the NP immediately- to its eft 

We may also hope that (I*. 26) will explain the difference in 
acceptability between pairs like thkt in {k.ko) (which have already 
been brought up in discussions o'f adverbials). 

(h.Uo) a. The car which a reporter mentioned the scene 
of the cr«^h of was a new Mustang, 
b. »The car which a reporter found Mary at the 
scene of the crash of was a new Mustang. 

Indeed, in both ik.ko&) and (U.kOh) ve find a chain of HP's in 
which every non-extreme NP is a Satellite of the NP to its left -and' 
a Nucleus of the NP to its right; but onl> in (l*.l*Ob) this whole 
chain of HP's is in turn a Satellite of a sentence, namely, a repo rter 

found Mary. Thus, we may claim -that even though both sentenciT"^ 

exhibit a right-branching chain in which each pair of adjacent 
constituents exhibits the Nucleus-Satellite relation, it in only in 
}, * Nucleus-Satellite pair is of the same kind : in 
{'♦.'♦OoJ, the highest pair has the structure C S ADVD, while the 
lower ones have the* structure CNP NP3. Thus, it is onJjr in (k.UO&) 
that the chain of N & S's can be" enumerated with one recursive 
procedure. 

We have so far established that (i».38b) differs from other 
instances, in which the violation of the Subservient Satellite 
Constraint results in unaccept ability, in that the maimed N & S 
IS here recursively enumerable by one procedure; we have also 
established that the difference in acceptability between (l*.l*Oa) and 
[U kOh) correlates with the recursive predictability of the former, 
but not of the latter, with one procedure. However, in order to 
exglain these facts, it is not sufficient to recognize that greater 
predictability requires less processing, and therefore less strain 
on immediate memory; some of the reasons are: 

(a) Neither in (l*.l*Oa) nor in (l*.l*Ob) does the recursively 
predictable material constitute the whole of the interdiscontinuous 
material, and the processing of the latter requires more than one 
procedure in both cases. It seems that the crucial difference 
between the tw,-> sentences lies in whether the chain of N & S's, 
rather than the whole of the interdiscontirtuous material , is recur- 
sively predictable with one procedure. 

(b) As we shall see in k.k^ the unacceptability of structures 
which violate the Subservient Satellite Constraint cannot be (entirely) 
attributed to the structural complexity of the interdiscontinuous 
material and considerations of memory-load. To see thip, we need 

only consider the difference in acceptability between (l*.l*la) and 
(^♦.'♦Ib). 
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i^.hl) a. What did I tell you Bill likes? 

b. »What did I tell you Bill likes and Mary 
hates potatoes? 

«nH^Ji;^^''l (J*.l*lb), the interdiscontinuous material is identical' 
and therefore equally complex to process, but only the latter ' 
exhxbxts a violation of the Subservient Satellite^ConstJi"" 

Similarly, m ih.k2), the only element occui-ring in the inter- 
^scontinuous material which creates unacceptability'is the head 5p 

ii*.U2) Who did you hear (»the claim) that John loves? 

componLS'^ Sus T'' t''^' ^^tveen the discontinuous 
components Thus, in Japanese, the head of a complex NP follows +he 

clause by Topicalization is out, as shown by (4.l7d, e) and (li.l7f, g). 

(1*.17) d.(»)Otto wa, watakusi wa kono boosi o kabutte 
°**o I this hat wearing 

ita to iu syutyoo o sinzita. 
was that say cleiim believed, 
••otto, I believed th>i claim was wearing this 
hat. • 

e. Watakusi wa Otto ga kono boosi o kabutte ita 

I Otto this hat wearing was 

to iu syutyoo o sinzita. 
that say claim believed. 
•I believed the claim that Otto was wear^nr 
this hat, 

f . Kore was Mary ga kabutte ita koto ga akiraka 

This Mary wearing was thing obvious 
no boosi da. 
is hat - is 

•This is the hat which it is obvioun that Mary 
was wearing. • 

g. "Mary wa, kore wa kabutte ita koto ga akirawa 

Mary this wearing was thing obvious 
na boosi da. 
is hat is 

•''^^^'this is the hat which it is obvious was 



Notice also that in (1.36c), the aaverblal reading, of I sumose 
Uoris'SoU S constraint, but LrSS 

l s:''?n"ao1f -i1i°lt^^:,g the^^nteraiscontmuou. «.terlal (there 

(1.36) c. It^s me who supposes that John is sick. 

to a\8r-nf °^ predictability uncovered in relation 

to (1*.38) and (J*.4o), even though probably relevant to the data at 
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issue, cannot provide-on explanation based solely on considerations 
of memory-load, and thajt something like the following sup:^3:ests itself: 

(U.U3) A recursive procedure does not generate 

significant new relations beyond those that 
result from one of its applications. 

What (U.U3)~whose statement should xindoubtedly be further 
constrained — purports to say is intuitively clear: (U.39) generates 
a second NP which bears tjie^relatlqn "nominal complement of" to the 
first NP, and no matter how many times the procedure applies, the 
lower, nominal complements will still be perceived as n gminal ^oy^p;^e- 

only, not as parts of " higher" npmipap. com p lements: in some 
sense, the application of the procedure (U.39) cemnot generate new 
relations Just because it has applied more them once ^ This is what 
we would expect if the structural hypothesis about chains like the 
covers of the reports and the height of the lettering on the covers 
of the reports is made essentially in the ' same way, i.e., by one 
procedure in both cases. The gist of the matter is that, given (U.U3), 
(^.39) and (U.UOa) are felt as instances of Satellite-choppinfr, but 
not of Satellite maiming. On the other hand, the moved constituent 
in (U.UOa) is both a Satellite of the type 'nominal complement of 
NP* and part of a Satellite of the type 'optional adverbial'. There- 
fore, (U.UOb), but not (^.UOa), constitutes a violation of the 
Subservient Satellite Constraint, because the right-branching chain 
of N & S's is not recursively enumerable with a single procedure, as 
required by (U.U3). 

We call now reconsider the discussion in ^^.2.1, in the light of 
(^.^3). (U.22b) suggests that an object complement clause can be 
maimed, regardless of its -'depth*. In other words, a complement 
clause 'feels the same' regardless of whether it was generated by 
one application of (U.23) or more than one application, so that both 
(U.22b) and (U.UU) are perceived as instances of maiming of complement 
clauses and not of instances of maiming of different kinds of clauses . 

{U.UU) It's John I believe, Jill loves best. 

It is tempting, on the basis of the above facts, to propose a 
•universal principle like (U.U5). 

(^•^5) If a discontinuity can involve a constituent of 
_ type C generated by one application of a 

recursive procedure PR, it can also involve 
a constituent of type C generated by any 
number of uninterrupted applications of PR. 

I do not, at this stage, firmly propose (U.U5) as a universal of 
language, since I do not quite know how to sccount for counterexamples 
like (U.U6). 



a. John is upset Cbecause Mary left him fibecause 
he is crazy]]. 
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{h.h6) b. It* 5 because Mary left him because he is 
crazy that John is upset, 
c. It^s because he is crazy that John is upset 
because Meiry left him. 

(U.Uba) is a chain of right-branching N & S*s generated by a recursive 
procedure (that the procedure is indeed recursive can be seen by 
continuing (U.U6a) as • .. .because he fell on his head recently because 
somebody tr oed him. . . ) . Because- clauses can be clefted<. as shown 
by (U.U6b), w •cleft counterpart* of (U'.U6a). However, (U,U6c) is 
not a cleft counterpart of (^.U6a), as -*;he clefted constituent can 
only be understood as modifying John is upset because Mary left him , 
but not Metry left him . 

It appefiurs that tl:^ lower because- clause in (U.U6a) is felt as 
a part of a Satellite, not just as a Satellite. Obviously (U.U6a) is, 
in some sense, different fl-om the source of (U.22b) or from (U.38a), 
but stating the difference in a satisfactoiy way is no easy task. One 
differentiating factor ?Duld be the fact that the string in (U.Uba), 
unlike the source of {K ?2b) or (U.38a), is potentially ambiguous 
in maihy ways, since any of the because- clauses can be \mderstood as 
modifying not Just the clause immediately to its left, but any of the 
clauses to its left in the chain. Some of these possibilities ar' 
not realized when a clause other than the one immediately to the ripiht 
of the main clause is clefted, since clauses modifying the same clause 
are logically coordinated, and cleft ing one of them is prohibited 
by the CSC. However, in (U.U6c), at least two readings are theoreti- 
cally possible: the clefted constituent logically moaifien either 
John is upset becaus e Mary left him or Mary left him . If it can be 
argued that the former readin/3^ is psychologically more comf^lex than 
the latter (after all, the former is clearly more complex even in 
(U.Uoa), since it requires us to understand that John is upset for a 
very special reason, that is, that Mary left him because he is crazy, 
not just that she left him; this kind of complexity increases consider- 
ably as the chain is lengthened), then we are faced with a 'trans- 
derivational constraint situation' similar to the one discussed in 
2.2.U.1 in connection with Perlmutter's a woman hit a girl who was 
pregnant ; in other words, the resolution of a discontinuity is waived 
when the option of a less complex: interpretation i*- available. 

' In all probability, these facts are only a partial explanation 
of the non-existence of a transformav onal relation between (U.U6a) 
and (if.U6b). In any' event, I am. hopeful that when a satisfactory 
explanation becomes available, it will be possible to uphold (U.U5) 
as a universal of language, the non-derivability of {k.kCc) from a 
structure incorporating (U.it6a) being then attributable to well 
supported independent factors. 

i*.2.5. Before concluding section li.2, which has been essentially 
concerned with the effect of predictability on structural complexity, 
it 5.3 perhaps worthwhile to illustrate the following rather obvlouii 
point: no complexity (in fac**, no structural constraints of any kind) 
should arise when the frame in which a discontinuity Is eml)edded l:\ 
:>tructurally irrelevant. Thus, consider (U.47). 
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(U.UT^ a. Somebody^ met a man who loves someone ^s, wife. 

b. Who met a man who loves someone^ s wife? 

c. *Who met a man who loves whose wife? 

d. Who met a man who loves Bill's wife? " 

We notice that (U.UTb) but not (U.UTc), can be derived from (U.UTa). 
The reason is that in (U.UTc) a feature-changing rule~jrti-placement~ 
violates the Subservient Sat.ellite Constraint. However, (U.UTc) is 
all right as an echo-question, when the interlocutor has said some-- 
thing like (U.UTd) and the speaker of (U.UTc^did not catch the word 
Bill;- or as an incredulity-question, when the speaker finds it hard 
to believe that his interlocutor should' ask that question about 
Billys wife (possibly because he knows that^ she has been dead for 
many years, or that Bill is a bachelor, etc'.^*]. Observe, however, that 
-echo-^ or incredulity-interpretations b'f (V/ll'Tc), the whole structure 
except for whose is structurally irrelevant , as it is only supplied 
as a framework for the hearer to be able- to pinpoint the object of 
the illocutionary force (request to repeat a phrase, or expression of 
surprise at the presuppositions of that phrase respectively). As the 
structural frame is not intended for processing, we are confronted 
with the limiting case of structural non-complexity, namely, 
structural irrelevance, and conseciuently no constraints of the kind 
discussed in this thesis obtain. 

A similar situation arises in the case of the so-called 
!examination-type questions', that is, questions which are not 
genuine requests for information — since the speaker knows the answer, 
and his purpose in asking them is only to test his interlocutor's 
knowledge. Such questions can violate island constraints, as r^hown 
by (U.UTe). 

(U.UT) e. Napoleon was a French general who invaded 
which country . 

The grainmaticality of (U.47e) on the examination-reading is 
explainable in essentially the same way as its grammaticality on the 
eclro- or increduiity-reading. Indeed, everything in (h.kle) except 
for the underscored part is assiimed to be common knowledge to both 
examiner and examinee; I suggest that the^non-under scored part of 
(U.UTe) is in fact part of the answer expected of the examinee 
(which the examiner provides since it is known anyhow), and the 
illocutionary force of the utterance is somethinp^ like 'complete 
the following answer by fitting the correct phrase in place of the 
one marked with a wh-feature' . Again, the non-underscored portion 
fiinctions as a framework, rather than material meant to be processed. 

U.3. This section is devoted to the Subservient .Satellite 
Constraint. For n^asons made clear below, the explanation will be 
based on conflict, rather than on interrupted behavior. 

We have-ialready seen in h.2.k that the unaccept ability found 
in instances of violation of the Subservient Satellite Constraint 
cannot very well be attributed to the structural complexity of the 
interdiscontinuous material, since the material on which unacceptability 
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could be blamed (that is, the Nucleus) is not necessarily part of 
the interdiscontinuous material. Moreover, it is implausible that 
the occurrence of a short^ and structurally simple Nucleus like the 
fact or the claim could place a load on memory sufficient to maJce all 
the difference between acceptability and unacceptability . 

The first objection presented above is comparatively easy to 
take care of. Thus, rather them regard the Nucleus as the 
unpermissible 'intervening material', we shall attribute that status 
to the N & S node , by extending the notion intervene to apply not 
only to the lirieetr, but; also ^to the vertical, dimension (thus, we 
say that, a node A which dominates B but hot C -intervenes between B 
and C). We can see that regcirdless of the relative linear ordering 
of the Nucleus and the Satellite, violations of the Subservient 
Satellite Constraint will always create a discontinuous Satellite 
with an intervening N & 5 node, as the element externeG. to the 
Satellite is by. definition outside the dominance of the N & S node. 

Concerning the second objection, I conclude that the complexity 
of interrupted behavior can indeed not be regarded as essentially 
responsible for the Subservient Satellite Constraint, and that a more 
plausible account can be given in terms of perceptual conflict. 
Specifically, I propose that the following principle is (at least in 
part) responsible for the Subservient Satellite Constraint: 

(ii.liS) A stimulus may not be perceived as simultaneously 
having m mUjaaUL y conflicting values on the 
same classif icatory scale. 

In the Introduction, I pointed out that a generalization like (hMj) 
would constitute a near-truism, and therefore hardly present any 
interest; as can be seen, (U.l*8) is considerably more specific than 
(U.l*9) (thiS'was also true of the conflict principle (2.23)). 

(UJiy) Conflicting percepts are complex. 

Before showing how the Subservient Satellite Constraint follows 
from (J*.U8), I will briefly argue thac such a principle is inde- 
pendently needed for linguistic phenomena. Consider Langacker's 
restriction on the occurrence of pronouns: 

(U.5C) A pronoun may not beeur all relevant primacy 
relations to its antecedent • 

The relevant scale here is one of relative emtocipation . Thus , when 
some notion or referent occurs more than once, one occurrence is 
usually regarded as •primary* while the others are regarded as •mere - 
repetitions*.. The primary occurrence can be identified In at least 
two ways: f)p8itionally and morphologically. The positional relations 
involve- thW two available dimensions: precede , for the horizontal 
dimension, and command , for the vertical one (to conyince ournelvcs 
that the relation command expresses a genuine psychological reality, 
we may think of asymmetrically commanding elements as analogous to 
commanding officers in a military unit; the grairtmatically irrelevant 
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relation •higher up in the tree than' is anadogoiis to the militarily ' 
irrelevant one 'superior officer of). In most situations, both 
positional relations are relevant; in symmetric coordinations, only 
precede is, as the terms are hierarchical equals ot- one another. 
In both situations, when all the positional evidence points in one 
direction, the morphological evidence cannot point in the opposite 
direction, which is precisely what (U.50) says formally. Observe, 
however, that partial. conflict is, if not ideal, at least tolerable. 
Thus, (U.51a), in which the morphological evidence is contradicted 
by the horizontal, but not vertical, evidence, is an acceptable 
sentence, although perhaps less felicitous than (l*.51b), in which no 
conflict obtains . 

(U.51) a. The man who said that he loves her announced 
yesterday that he would marry Julie soon, 
b. The man who said that he loves Julie announced 
yesterday that he would marry her soon. 

In all probability, (1*.50) is a graimnatization of (^.1*8), and 
thus provides internal evidence in its favor. J! do not know of 
specific corroborating experimental evidence (tHe complexity of 
conflicting percepts has, of course, already^been studied extensively). 
In arguing in favor of (1*.52), in fact, a weaker subcase of (i*.U8), 
Bever (1970 ) mentions supporting evidence from visual perception. 

(I4.52) A stimulus may not be perceived as simultaneously 
having two positions on the same classificatory 
dimension. 

\> 

Bever argues that (U.52) is responsible for the complexity of multiple 
center-embedding, of triple negation, of chains of recursively right- 
or im--branching II & S's, etc. I do not believe that is 
solely responsible for the unaccept ability of the constructions Bever 
considers, as those constructions differ sharply in degree of 
acceptability (for some arguments that independent factors are 
involved see 2.1.3). Nevertheless, does seem' to be at least 

partly responsible for the complexity of triple negation, where the 
medial negative is both 'negator^ and ^negatee', or for the (rather 
faint, in my opinion) complexity .of the picture of the owner of, the 
house, where t he owner is a dependent of the picture but the head of 
the house . (I4.52) is undoubtedly a much weaker restriction \han 
(^.^8), since it involves an apparent, rather than a real, conflict; 
indeed, the medial element in the cases Bever considers is emancipated 
with respect to some element and downgraded with .respect to another 
element, the only problem being that the two (or more) pairs of 
elements are Judged with respect to the same relation; in contrast, 
in crses which violated (<*. 50 ) , an element (the pronoun) is both 
maximally emancipated and maximally dowgraded with respect to the 
same element (the antecedent ) Consequently, it is to be expected 



The definition I proposed for symmetric coordinate terms as 
bidirectional N & S^s (when there are' two terms) does not predict 
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that symmetric coordinations are complex, as is not appli- 

cable. Indeed, I did not propose that the terms are simultaneously 
perceived as Nuclei (emancipated) and (Satellites (downgraded) , 
but jather that we can look at symmetric coordinations in as many 
ways as there are terms. Put differently, in the symmetric structure, 
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is Nucleus and X2 is Satellite, but we know that for each such 
structure there is a synonymous one differing structurally from this 
one only in that X2 appears in the position occupied by X^ and vice 
versa. 



that violations of {k.hQ) will be much more severe than violationn of 
(1*.'J2), and that graramatizations of the former will in general occur. 

Having established the psychological reality of (U.U8) in 
language perception, we turn to the relation between (U.U8) and the 
Subservient Satellite Constraint. » 

Hierarchical relations among coBstituents also exhibit a scale 
of relative emancipation , as we have seen coreferents do. In endo- 
centric constructions in general, the head bears a primacy relation 
to its dependents, which we shall call the relation of centrality . 
In N & S*s, the Nucleus is not only central, but also self-sufficient, 
since the Satellite unlike other types of dependents, is not a partner 
in the N 4 S, but an optional addition; we shall say that the Nucleus 
bears to the Satellite the re?ation of sufficiency . These two 
relations transfer quite naturally to the N & S node in relation to 
the Satellite, the latter being non-centrcJ. and non-essential with 
respect to the former. Let nbn-centrality emd non-essentialness be 
status relations > Then in terms of such status relations, a-Satellite 
is maximally downgraded with respect to its N & S node. 

Let us now consider what the psychological import of a discon- 
tinuous Satellite is, when the N i S node intervenes in the sense 
defined above. In a situation of that nature, the structural . 
arrangement is such that the well-formedness of the Satellite is not 
decidable without taking the external element into account; in other 
words, the structural integrity of the Satellite depends on something 
elpe than the N & S node or the material it dominates. Thus, on the 
scale of structural emancipation, the relevant status cues suggest 
that the Satellite is wholly -dependent on the N 4 S node, while the 
structural-integrity cues suggest the opposite; hence, a conflict 
\^ich results in complexity, as (^.U8) predicts. 

Despite superficial differences, the Subservient Satellite 
Constraint is quite similar to Langacker^s constraint on pronominali- 
zation. In both cases, total conflict is unacceptable and pai*tial 
coriflict is not. Thus, analogous to the' acceptability of (l4.51a), we 
rind the endocentric constituent VP, which may consist of a verb and 
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a complement clause (as well as other elements not relevant to our 
discussion); in such a case, the dependent, i.e., the complement 
clause, is non-central with respect to the VP node, but it- is not 
non-essential; consequently, the maiming of the complement clause is 
permissible. 

Notice that the conflict between status and integrity facts 
which arises due to violations of the Subservient Satellite Constraint 
cannot arise when the whole Satellite has been chopped. Indeed, as 
the Satellite does not exist under the domination of the N & S node, 
it is patent that its structural integrity cannot be decided on the 
basis of the material dominate*! by the N & S node, and the opposite 
assumption does not arise. 

Before concluding this section, it shoxild be pointed out again 
that violations of the Subservient Satellite Constraint are not 
always obse:rvable from surface structure; this fact has always been 
recognized, and it led Ross to his claims about rule ordering (Hons 
(1967)), and later scholars to the reformulation of the various 
island constraints as global ones. At least four cases can arise 
in which the surface structure does not contain all the necessary 
information; they are illustrated in (^♦.53)-(^ .56) below. 

(I4.53) *It*s Mary who I believe what you told* 

(U.5^0 *This the girl I saw a boy who likes. 

(h.^j) *It*s Mary who a boy rftiat left who likes. 

(^.56) (?)l am aware of the\fact that you like someone, 
but I don^t know who.\ 

In (U.53), the intervening N & S node is not obviously present in 
surface structure, since the Nucleus has been deleted; in (^.J^)* the 
external element which forms one end of the discontinuity (the 
relative pronoun)- has been deleted; in ('♦.55) » it is not obvious we 
are dealing with on N & S, as the Nucleus and the Satellite are 
non-adjacent; finally, in (i*.56), the whole N & G is absent, the 
only suggestion of a discontinuity at some level being the external 
clement who. In the first three cases, the degree of unacceptability 
is essentially the same as in the corresponding overtly clear cases 
(in fact, (U.55) .^s probably worse, in view of the difficulty • 
Involved in integi'ating a double discontinuity); in the last case, 
unacceptability ii\ considerably reduced for most speakers, and non- 
existent for some speakers. 

Regarding (U.53), 1 argued at length in 1.3.1 that the 
reconstruction of heads on the basis of the double function of 
words like \^;-forms is a highly efficient strategy, and therefore 
the hearer \;ill have, in all probability, reconstructed the N & S 
node at the point at which integration is attempted. 

Concerning {U.^k)^ it is not really true that the external 
element is invisible; instead, the externeJ, element is now the head 
of the relative clause (t he pirl ), rather than the wh^prdnoun which 
could have appeared next to it. The situation is similar to the one 
found with Japanese relative clauses, where the relative pronoun Is 
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always invisible (in k.k^ I shall argue that the distinction between 
Japanese and English is a trivial one). 

With respect to (^.56), the improvement in acceptability due to 
the disappearance of most of the N i S (at^ well as of additional 
material, in certain cases) deserves a separate study, which I have 
not Tindertaken. The first temptation is to say that speakers do not 
necessarily interpret (1*. 56). by . reconstructing the whole of the 
deleted material, but in some ad hoc way, by appending something like 
he is at the end of (^.56) • Another possibility would be to assume 
that even if the deleted material is reconstructed, it need not be 
reconstructed with a deletion site in place of the moved element, 
but rather with a pronominal copy (we recall that copying rules are 
not subject to the Subservient' Satellite Constraint), Chomsky (1970b) 
suggests that 'understood materiatl' as in (1*.56) should be interpreted 
by 'compositional semantics*, rather than' by postulating a deletion. 
This proposal, like the guesses I have made above, is insufficiently 
elaborated on to be amenable to discussion. 

Any serious proposal along the. above lines should explain why 
forbidden operations which »take place inside deleted strings '^ some- 
times cause unaccept abilities and sometimes do not* Thus, \inaccept- 
abilities due to violations of Cross-Over are still felt when the 
evidence that there has been a permutation of cor-ferents in no 
longer present in the stringy as shown by (k.^l). 

. {h.57) *Mary was hit by Bill^, but he^ wasn't Chit 

by himself jD, i 

It might be suggested that there is no way in which compositional 
semantics could reconstruct the bracketed part of (1**57) without 
producing a coreferent of he^, while the missing string in can 
be reconstructed without actually producing a maimed island. But 
until more pertinent data are considered and until these vaprue 
proposals are made precise, there is little point in further 
speculation. 

k.k. In Chanter One, I mentioned Ross* observation that 
certain rules do obey his island-constraints, while others do not. 
His statement, which I called the Dichotomous Behavior Principle (DBP), 
is reproduced below. 

(l.t>2) Chopping rules, feature-changing rules, and 
unidirectional rules of pronominalization 
coey the constraints; bidirectional rules 
of pronominalization and copying rules do 
not . 

I have pointed out in various places in this thesis that the DBP is 
inadequate for at least the following two reasons: 

(1*.58) a. Island constraints cannot be restricted 
to trcmsformatl ms. 
b. Ross' distinction between unidirectional 
and bidi.rection£LL pronominalization is 
both false axid misleading. 
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In relation to Ross» distinction is false, because he 

attributes the unacceptability of (li.59) to the fact that VP.- 
Deletion ( a bidirectional rule) goes into a comT)lex NP (as we have 



Furthermore, Koss* distinction is misleading because it misses 
the point that deletion rules are subject to island-conntraints if 
and only if there is an obligatory identity condition between the 
/deletor^ and the *delete^f> The Japanese rule of Relativization is 
a case in point, and some of the rules of 'unidirectional pronominol- 
ization* Ross mentions constitute illustrations of the same principle. 
The rules illustrated in (l*.6o) are particularly interesting because 
the core ferentiality condition between the underscored elements in 
U.bOaJ and (l^.bOb) is invited, in the sense of Geis'and Zwicky 
(1971 )» rather than real. 



i^^^^ence-inviting factor in constructions like those in 
(U.60) is the fact that the fo r-to clauses are •resultatiye* , in 
the sense that the events or state of affairs expressed in them must 
be a consequence of the state of affairs expressed in the main 
clause. Consider,, for example, (J*.6l). 



Upon beirj presented with sentences like (l|.6la) or (Ji.eib), many 
informants reject them as semantically incoherent. However, when 
some of the unexpressed assumptions are pointed out to them, they 
realize that the sentences in question ore semantically well-fcr'':ed. 
Thus, if it is assumed that. oohn> is bearinip: the rock on his dnouJders 
while trying to lift the table, {h.6l) makes perfect sense; the svate 
of affairs expressed in the for-to clause now follows from that 
assumption in conjunction with the proposition expressed by the n^in 
clause. Similarly, if we imagine a culture in which one is forbidden 
to put on a bathrobe unless a pair of socks h i,gs from the chandelier, 
the for-to clause of (l*.6lb) follows from that piece of pragmatic 
information in conjunction with the main clause, and the whole 
sentence becomes entirely coherent. 

It should be noted that, while the for-to clauses in (ii.6la) or 
(l*.6lb) do not contain a coreferent of the rock or the sock 
respectively, coreferents of these phrases must show up in the 
pragmatic presuppositions of the two sentences, which we may formulate 




I know a boy who is 



{h.Co) a. This rock is too heavy for me to try to 
help Bob to pick (it^) up. 
b. The socks are ready for you to put (them) 



on. 



(l*.6l) a. This rock is too heavy for John to try to 
lift the table, 
b. The socks are ready for you to put on the 
bathrobe . 
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as (U.6lc)'^d (U.6ld) respectively. 



(U.bl) c. John is bearing the rock on his shoiilders. 
d. The socks must hang from the chandelier 
before one is entitled to put on a 
• bathrobe . 



I do not see how the pertinent assumptions of {U.6la) and (U.6lb) 
cotild be coherently formulated unless they contain an occurrence of 
the rock or the socks respectively. Consequently, a coreferentiality 
condition must be satisfied at some level for sentences like (U.60a) 
and ik.GOh) or (U.6la) .andr.(U.6lb) to, make sense. In the absence 
of assumptions like (U.6ic) and (_U.6ld), a hearer assumes that the 
necessary connection between the meiin for-to clause in sentences 
like (U.60a) and (U.60b) will be made explicit in the actual strinfr, 
and. expects a coreferent of the subject of the main clause to show 
up in the for-to clause. Put differently, the resultative character 
of the for-to clause strongly invites the inference that it will 
contain a coreferent of the subject of the, main clause. This -invited 
inference is sufficient to cause the rxile which may delete the 
coreferent; inside the for-to clause to be subject to essentially^ 



A. Zwicky informs me that the deleted versions of the a- 
sentences in (U.62) -and -(U.63) are out for him, but that the c- 
sentences are marginal, while the b-ohes are OK. I don't know 
what to make of this fact, except assume that the obligatory^ 
'Coreferentiality condition may be less strong for some speakers 
when it is only invited; however, I do not know how to account for 
the gradations in acceptability within the above paradigms which 
Zwicky reports. 



the same constraints as niles which delete a coreferent under strictly 
linguistic conditions of obligatory coreferentiality, such as 
Japanese Kelativization. Ross (1967) illustrates the point with 
respect to the first part of the CSC, and the LBC and the f5SC, in 
the case of the deletion rules manifested in (U.6o); as none of these 
constraints are Vubcases of my Subservient Satellite Constraint, T 
provide pertinent illustrations in (U.62) and (U.63). A parallel 
paradigm is constructed for Japanese Relativization in (U.6U). 



(U.62) a. 



The rock is too heavy for me to believe that 
a boy willing to pick CJ"^ f up exists. 




This rock is too heavy for me to both lift 




This rock is too heavy for me to believe that you 
broke your back in lifting •'•^ V ' up. 
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{h £3) a. The bathrobe is ready for you to find a girl 
\ willing to put ^ 

b. Tlie bathrobe^ is ready for you to simultaneously 
take ^'^^ J* your socks on. 



c. The bathrobe is ready for you to get decent 
by putting ^ on . 



ik.Gh) a. »Suki na onno no ko ga sotta teeburu.wa Tookyoo 
Likes girl left table tokio 

ni am. 

-in is 

* 'The table which the girl .who likes left is in 
; Tokyo.! 

b. *John wa Mary ga suki de, Jim ga suki\na onna 

John Mary likes Jim likes 
no, kb ma kiree da. 
girl pretty 
*'The. girl who John likes Mary and Jim likes 
is pretty. • 

c. *John ga suki ta kara Tookyoo e itta ohna no ko 

John likes because Tokio to- went '"girl 
wa okotte ini. 
mad . is 

*'The girl who John went to Tokyo because he 
loves is mad. ' 

It can be seen that (U.62)-(U.6U) and (U.59), which Ross lumped 
together as unidirectional pronominelization phenomena, are essentially 
different; thus, unacceptability arises in the. latter case independently 
of deletion, while in the fomer, it arises only when deletion, hot 
merely pronominalization ^ has applied. Also, . Ross* only reason for 
regsirding the deletion phenomena in (U.6o)-(|I.6U) as imidirectional 
is that the complement clause of heavy or read y apparently cannot be 
preposed; but, by the same token, there is no reason for claimin/t 
that the rule which allows a non-null pro-form in the above examples 
is bidirectional, and if it is not, then non-null and null pronominali- 
zation appear to behave differently. Notice that Ross cannot claim 
that that rule of non-null pronomirialization is the same as pronominal I- 
zation in general, for he chose to regard VP-Deletion in comparativeri 
and equitatives as unidirectional and therefore different from Vj*- 
Deletion in general. 



Actually VP-Deletion in comparative or equitative clausei; 
differs from VP-Deletion in other environments in being obligatory, 
as the contrast between (i) and (ii) shows. 
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, _^ , ^ f 

(ii) -John is teller than Bill is ^^811^ 



I do not think, however, that this fact shbiild be construed 



as an indication that the rule which optipncilly deletes clever 
and the one which obligatorily deletes tall in (i) and (ii) 
respectively are distinct rules. 



Given the inadequacies of the DBF, I propose to reformulate it 
as the observationally more adequate (U.65): 

The DBF': The Subservient Satellite Constraint 
is applicable to chopping, deletion \mder 
obligatory coreferenticLLity, feature-chan/^inf;, 
and semantic neutralization; it is not 
applicable to non-null prbnominalization or 
to deletion without obiiRatory coreferentiality . 

11 * 

I mention deletion under obligatory coreferentiality because 
I do not know of any deep structure conditions mentioninf^ obligatory 
i dentity of sense . 



The explanation I propose for {h.6')i is 

(U.06) The situations in which the Hubservient Satellite 
Constraint applies aire perceived an involvinp: 
a structural, discontinuity; the reraainder are 
' not . 

The point of the claim made by (U.66) cetn be seen most clearly 
by comparing- chopping and copying phenomena, which the DBF* assigns 
to different subclasses, as shown in (U.6Ta) and (U.67b) respectively, 

(U.67) a. *This is the girl who you met a boy who likes , 
b. This is the girl wh£ you met a boy, who like s 
her Cdialectally restricted^. 

In both sentences , the N i B .and the pertinent externail element have 
been underscored. The essential difference is that the r'atolllte in 
ill-formed in (U.6Ta) unless integrated with the external element, 
while this is not so in (U.67b). In this connection f propose the 
following hyf)0thesi3; 

(J|.60) The relation which -^ob tains between the external 
element ismd the Gatellite in sentences like 
(l+.67a) and (U. 67b) are processed by 
essentially different strategies . 
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Thus, in the former case, the discontinuity is resolved by s tructural 
integrative strategies ; in the latter, it is handled by strategies 
vhich recognize semantic relations between elements . The recognition 
of identity of reference in (^4. 67b) is, I suggest, not essentiedly 
different from the recognition of a relation of antonyry or 
'opposition' between the underscored' constituents in {h.SOei) and 
(j*.^9b) respectively. 

(1*.69) a. Jphn is cl ever , but his cousin is stupid . 

b. John doesn't dislike cake, he loathes it. 

I hope that it is intuitively clear why I do not regard the sentences 
in {h.69) as containing structurally discontinuous elements, although' 
it as not easy to make this notion formally precise. Also, I do 
not know of any experiments vhich support or ca^t doubt on CU.68), 
and it is not easy to see how one would go about setting up pertinent 
experiments. _ I .hope, however, that my proposal is a plauf>ible one, 
and I will try to show in the remainder of ^this sjsction that-i^t- 
provides a natural and "internally consistent explanation for the 
DBP'.. . . - 

Notice that it would be possible, in principle, for sentences 
like (^. 67a). to be interpreted by reconstructing 8 null anaphor in 
the position of the object of like , ' which' would be later interpreted 
as coreferential with who by semantic strategies. Jf that were the 
case , we would expect there to be no distinction in acceptability 
l)etween (lL.67aO and (l*.67b), but the fact that there is such a 
distinctipn suggests that a null anaphor is not. reconstructed. The 
reason is^ I suggest, that the recognition of the existence of a 
necessary coreferent. of some element falerts ' the integrative 
strategies, so that integration will have to ta>:e place when the 
chance arises; on the other hand, nothing alerts the strategies which 
reconstruct missing elements , and what takes place is integration, 
rather than reconstruction followed by recognition of identity. 

The position I have adopted predicts that deletion under 
obligatory identity will always behave identically, for the purposes 
of the Subservient Satellite Constraint ahd other island types, with 
chopping rules. As far as I know, this prediction is borne out by 
the facts (see {kX2)^{k.Gk)). A corollary of this position in that 
whenever the deletion of a pro-form is exempt from island constraints, 
there is no obligatory condition that a candidate for deletee status 
exist; I will show below that this qorollary is borne out by the 
available facts. Finally, this position predicts that Neubauer's 
claim (which I argue against in Appendix Four) to the effect that 
chopping rules behave differently from copying lules followed by 
the deletion of the original cannot be correct in principle, for the 
effect of copying is to introduce a necessary coreferent of the 
original. 

Notice also that for a theory which regards constraints like the 
Subservient Satellite Constraint as 'purely grammatical' ones, there 
is no way to capture the similarity between chopping and deletion 
under obligatory identity. Indeed, if chopping proceeds in one 
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step, the Subservient Satellite Constraint will be a transformational 
constraint for chopping arid a global one for deletion; if chopping 
proceeds in two steps ^ the Subservient Satellite Constraint vill be 
a global rule in both cases, but it will have to mention the output 
of copying in one case^ and underlying representation in the other. 

Before leaving the subject of chopping and deletion under 
obligatory identity, I think it worthwhile to dispose of an apparent 
counterexample to my account. 

{h.^0) *This is the apple which I know a boy who ate . 

In (U.70), the H & S is hot incomplete, since we can understand it 
as a, boy who ate something; . Unlike (U.67), where I claimed that 
nothing alerts the reconstructive strategies , the latter are alerted 

"- in (U.70) by the lexical properties of eat (which allows the deletion 
of its object if that object is a generic ) . As reconstructive 
strategies are generally based on ^ local cues.*, i.e., found in the 
same syntactic perceptual unit as the missing element, and since 
perceptual units must undergo' sor;ie processing before they can be 

•effectively integrated, one would expect reconstructive strategies 
to apply before integrative ones in general. Given such an ordering 
of strategies, the unacceptability of (U.70) seems to- pose a problem. 
The problem is, however, a spurious one: the missing elements in 
(U.70) can only be reconstructed as a generic, and generics have no 
antecedents . Therefore, if a genaric i£ reconstructed, the sentence 
fails' becliuse the semantic strategies cannot establish a connection 
between it and w hich ; if a generic is not reconstructed, the sentence 
fails for the same reasons eis (U.67a)» 

In contrast to (1*.70), consider the following paradigm (taken 
from Hoss (iy69b)) which shows that the reconstruction of an ana phoric , 
rather than generic , pro-form does not lead to unacceptability. 

' (U.7I) a. *I know that he must be proud of it, but I 

don't know how. Cthe sentence is unacceptable 
only on the reading 'I don't know how proud 
of it he must be', not on the reading 'I 
don't know in which way he must be proud of 
it'D. 

b. Johnny stole someone's wallet, but I forget 
whose . 

The difference in acceptability between (U.71a) and (U.71b) is duo to 
the fact that there exists a rule which deletes the heads of genitives 
\mder identity of sense, as iii I like this book, hut I don't lik e 
Bill's (book) , while there is no rule which deletes the heads of 
degree modifiers, as can be seen in (14.72). 



(U.T^) Jill is T3retty, but I don't know how 
she is. 




Consequently, the inapplicability of a head-constructive strategy to 
makes it impossible to' recover it as (U.73a), while the 
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applicability of such a strategy to (l*.Tlb) makes possible its 
construction as i^.TSh). 

{U.^3) a. I know that he must be proud of it, but I 
don*t know how proud (of it he must be); 
b. Johnny stole someone wallet, but I forget 

f wallet - " 

f (he stole )• 



Notice that neither (l*.73a) nor (l*.T3b) violate the LBC, whrie (l*-71a) 
does, which explains its unacceptability.l^ The contrast in accentability 



G. Drachman has pointed out to me that (i) is ^marginally 
acceptable ih his speech. 

(i) (?)l know that he must be proiid of it, but I don*t 
know how much. 

This^ suggests to me -that (i) must be derived from something like (ii), 
because how much is not in general movable from a left branch, as 
shown ' in. (iii). 

(ii) I know that he must be proud of it, but I don't 

know Eg he must be proud of it to wh-some extent]. 

(iii) a. John has a lot of money. 

b. How much money does John have? 

c. *How much does John have money? 

An apparent counterexample is furnished by (iva), since its source 
seems to include (ivb), rather than the \mgrairanatical (ivc). 

(iv) a. You don't know how very much John is in love 
with Mary. 

b. John is very much in love with Mary. 

c. *John is in love with Mary very much. 

However, the ungrammaticality of i(iiic) suggest that (ivc) is, 
after all, part of the source of (ivc), and (ivc) must become (ivb) 
by an obligatory proposing rule; 

I should like to point out that, while the LBC does Indeed 
account for (l|.71a), there are certain problems with it which 
suggest that some reanalysis would be in order. 

Thus , if how may be dominated by the category NJ^, it is hard to 
extend this categorization to prepositions and articles; still, nuch 
a move is suggested by the impossibility of deriving (vb.) from (va) 
or (vib) from (via). 

(v) a. John is In love with Mary Cnot 6ut of love' with 
herD . 

b. *It's in that John is love with Mary. 



9. 



(vi) a. I saw the man, not a man* 

b. *What I. saw man was the, not a. 

Notice that the LBC may be insufficient even if prepositions cuid 
articles are categorization as HP's, for it is highly improbable 
that languages with postpositions or post-nominal eirticles allow, 
the movement of such constituents. 

This would seem to require a RBC, instead of .the LBC, in such 
Ipiiguages. Notice, however, the Rumanian has both pre- and post- 
nominal articles (the indefinite and definite ones respectively), but 
if we impose both the LBC and the RBC in Rumanian, we wrongly predict 
that sentences like (vii) or (viii) are ungrammatical. 

(vii) Cit e Petre de degtepti 
How is Peter of clever 
'How clever Peter is!' 

(viii) Diferenja e enorma tntre un cere §i un patrat. 

'The difference is enormous between a circle and 
a square. 

I should also point out that Ross* explanation of the freezing 
of the heads of Complex NP's by the LBC strikes me as incorrect, for 
heads do not move in Japanese either (see (l^.lOl*) and ( 1*. 105) ), 'where 
they appear on the right-branch of; higher NP's, or in Hindi, where 
they may occur on either the right- ^or the left-branch of higher NP's* 



between (U.Jlb) and (U.70)- is thus due to the fact that the empty slot 
in the former is interpreted as a null anaphor, equivalent to one^, 
whose antecedent is wallet ^ while the empty slot in the latter can 
only be interpreted as a, generic, and therefore non-anaphoric, or as 
an empty slot, in which case the integrative strategies are alerted. 

We have so far discussed chopping, deletion under obligatory 
coreferentialit y and no n-nu^l pronominalization (illustrated by 
copying). It' remains to discuss the inapplicability of the Subservient 
Satellite Constraint to deletion without obligatory identity, and its 
applicability to featxire-changing and deep stmcture neutralir.ation. 

With respect to deletion without obligatory identity, the 
inapplicability of the Subservient Satellite Constraint follows quite 
straightforwardly- from the preceding discussion, as the lack of an 
obligatory identity condition fails to alert the integrative ntrategies. 
In (U.7Ua)-(li.7Ud)^ I illustrate the non-applicability of the CNPC to 
the rules of Genitive-Head Deletion, Super-Equi-NP-Deletlon , V}>- 
Ueletion and Siyicing respectively. 

iU.lU) a. I know that you like lUll'n book, hut you muni 
face the fact that I am going to promoto 
Greg's. 

b. Max was worried that the fact that It was 

dangerous to shave himself with his rusty 
old blade bothered Shirley. 
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(^♦7^) c. Just try to sqvare a circle, if you believe 
the guy who t.ilA you that you can! 
d.^ Somebody put ice cream in Jim's bed and He 
didn't believe the guys who told him that 
they didn't know who. 13, 1^ 

^%xamples (U.7J*a)-(U.7Ud) are f rom Neubauer (19T0). 

■'•She rules illustrated in ih.^k) supply additional evidence 
that the two parts of the CSC are different phenomena, being subl- 
eases of the Frozen Nucleus Constraint and the Hubservient Satellite 
Constraint respectively. Indeed, while these rules are not subject 
to the latter constraint, they are subject to the former. I 
illustrate the point with Genitive-Head Deletion halow; parallel 
paradigms for the other rules can easily be constructed by the reader, 

(i) I know that. you like Bill's books, but I am going 
to promote Greg's J^^^oks^ ^^^^^ everything 

Bill has ever written. 

(ii) I know that yoy like Bill's books, but I prefer 

ibooks and plays ^ 



Jolin 



plays and books 



In none of the sentences of (U.7l*a)-(U.7Ud) is there a condition 
requir ing identity between the deletion-controller and the deletion 
site, as (U;7!?a)-'(U.75d) respectively show. 



(U.7I5) a. I know that you like Bill's book, but you 
must face the fact that I am going to 
promote Greg's play. 

b. Max was worried that the fact that it was 

dangerous for Bill to shave himself with 
his rusty old blade bothered Shirley. _ > 

c. Just try to square a circle, if you believe 

the guy who told you that you can use a 
pencil adequately. 

d. Somebody put ice cream in Jim's bed and he 

didn't believe the guys who told him that 
they didn't know who had tried to get even 
with him. 



Concerning feature-changing 
'yubservlent Satellite Constraint 
-.'»atellites contain Inp; forms like 



phenomena, the applicability of t))e 
is entirely expected. Indeed, 
wi^, ever, i^-pronouno , etc . , are 
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not well-forraed unless coroianded by the appropriate neutrallzer ; 
unacceptability arises vhe'iT'tht latter is outside the dominance of 
the N & S node, since the structural well-formedness ^of the Satellite 
can only be determined by recognizing. an unpermissible discontinuity. 

Finally > the semantic neutralization of the. 'comparee' by the 
•comparator' in comparative and equitative, constructions 'is subject 
to Subservient Satellite Constraint for the same reasons as feature- 
changing; the well-formedness of" the clause contcdning the comparee 
can only be established by inspecting the comparator. This is shown 
by the contrast* in acceptability between (U.76a) and (l*.76b), which 
.have the same adverbial clause, and where acceptability depends on 
the compat ability of the comparator and the comparee ((l*.76c) shows 
that (l*.76b) contains a possible comparator). 

ik.TS) a. Joanne is more intelligent than Jill is 
efficient . 

b. *Joanne loves her husband more than Jill is 

efficient . 

c. Joanne loves her husband more than Jill h ates 

her brother . 

In (It .76a) and (l* .76c), the comparee occurs inside a satellite 
(the comparative clause), but the N 4 S node does not intervene (in 
the sense discussed above) , since the comparator is precisely the 
Nucleus of this N & S. On the other hand, the sentences in (1*.77) 
are all bad, because an N i S node does intervene between the 
compairator and the comparee, and the latter is part of the corresponding 
Satellite. 

ih.n) a. *Mary is ^^""^^ intelligent ^^^^J I know 



^ ^ .a boy who is productive. 

b. Mary is -^^"J intelligent you heard 

(*the claim that) she is productive. 

c. *Mary is intelligent she is 

productive and an assistant professor. 

d. *Mary is ^"^"^^^ intelligent {jl^^ Bill is 

in love with hor because she is productive. 

Semantic neutralization as in (1*. 76)- (14.77) is subject to the 
Subservient Satellite Constraint in Japanese as well, as illustrated 
for the coordinate subcase in (1*.78). 

(1*.78) *John wa taijuu ga 500 ponto te, tsuyoi yori Bill 

John weight 500 pounds (and) strong than Bill 
wa motto tsuyoi • 
more strong 

**Bill is stronger than John weighs 500 pounds and 
O is strong. '15 

ERIC 
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^It is- interesting that while the newantic neutralization 
found in comparatives is subject to the Subservient [Satellite 
Constraint in Japanese, feature neu^redization is not, at least in 
the cases that I am acquainted with* Thus, Ross (1967) pointed out 
that Japanese Reflexivization is free from his constraints, and I 
illustrate the same point below with respect to t;ie Japanese counters- 
part of Wh-Placement (i»e., the ride which converts an indefinite 
NP to dare when that NP is the 'target* of some question verb). 

Watakusi wa dare no sukina onna no ko ga Tookyoo e 
I whom 1 ikes gi rl Tokyo to 

itta no ka to kiita. 
went asked* 
•l asked the girl who likeb whom went to Tokyo.* 

(ii) John wa Mary ga suki de, Bill'wa dare ga suki dcsu ka, 
John Mary like Bill -^^wh.^ like 
*Mohn likes Mary and Bill liken whoV* 

(iii) Watakusi wa John wa dare ga sukita kara Tokyoo e 

I John who likes because Tokyo to 

itta no ka to kiita. , 

went .asked, 
asked John went to Tokyo because he loves who.' 

It should be pointed out that most speakers of English have the 
feeling that, while violations of the Subservient Satellite Constraint 
by feature neutralization are unacceptable, they are not nearly as 
bad as the sentences, in (U.77); for example, (iv) is considerably 
more tolerable than (U. 77c). 

(iv) *I wonder who iikes Moscow and which other city? 




This suggests that violations of the Subservient Satellite 
Constraint by the comparator-comparee relation are, in some sense, 
much stronger than similar violations by the relation holdinf: between 
a neutralizer and a neutralized feature. In fact, it in not altogethe 
unexpected that it should be so, for the neutralization of a comparee 
by a compeorator is semantic, and therefore of greater impact than 
feature neutralisation, which is essentially formal (notice that 
feature neutralization, as! in (iv), while based on a semantic 
relatio n between the question verb and some wh-word, nevertheless 
involves no semantic neutralization of the latter by the former). 
However, it should be made clear that the sentences in (4.77) » though 
bad enough to deserve a double asterisk, are by no means se manticall y 
ill-formed. For example, (U.78d) means that Bill Ic in love with Mary 
because she is productive and that she is productive to a lesser 
extent than (or to the same extent as) the one to which Mary is 
intelligent; thus, (U.78d) makes perfect sense semantically, and its 
unacceptability must be attributed to some non-semantic principle, 
such as the Subservient Satellite Constraint. 
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To dunmarizet I have ar^iiued in this section that the partition 
determined by the DBP' on syntactic phenomena depends on whether a 
structxural discontinuity is or is not perceived • I proposed that 
the interpretation of Satellites which contain a pro-form whose 
antecedent is outside the. N 4 S requires the recognition of the 
antecedent-anaphor relation, but this does not prevent the Satellite 
from being perceived as whole; similarly, deletion sites can be 
interpreted as null anaphors^ provided that there is no indication 
that some external element necessarily belongs in that slot. With 
respect to the phenomena which are subject to the Subservient* 
Satellite Constraint, I argued that they do involve a structural 
discontinuity; for chopping and certain deletions, because an 
external element must be integrated with the Satellite, and for 
feature and semantic neutralization, because the well-formedness of 
the Satellite is partly determined from outside the N & S. 

1».5. In this section, we consider the problems raised by the 
Frozen Nucleus Constraint with respect to chopping (as I pointed out 
in Chapter Three, the facts are insufficiently clear for other types 
of phenomena to warrant an attempt at explanation at this stage; 
I suppose, however, that deletion urnler obligatory corefercntiality 
can be explained in essentially the same^way as chopping)* 

Two kinds of situations can risult from chopping the Nucleus 
and leaving a Satellite behind: 

(14.79) a. The clause out of which the Nucleus was 
lifted appears well-formed, or 
b* The clause out of which the Nucleus was 
lifted appears ill-formed* 

I will try to shov that, both (l**79a) and (l*.79b) are perceptually 
complex situations, involving erroneous closure and conflict 
respectively (as far as I can see, interrupted behavior does not 
seem to be crucially involved). 

(U.79) is illustrated in (U.80) and (li.Sl). Contrast this 
paradigm with the one exhibited in (U.82) and (J1.83). 

(J4.80) a. I believe (that) Mary^s claim that Bill left 
is false. 

b. «Wh08e claim do I believe (that) C3 that Bill 
left is falseD? 

• (l».8l) a. I think that John, Bill and Mary have left, 
b. •Who do I think that C3 John and Mary have 
leftD? 

(1|.82) a« The difference between X and Y is clear. 
^ b. Cg The difference is clearD between X and Y. 

(1«*83) a. You think that the roof of that house was 
bloim off by a bomb, 
b. Of which house do you think that the roof 
- vwas blown off by a borabD? 
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In both (l*,bO)-(l*,8l) and (li,82)--(l*.83) , the clause out of which 
an element has chopped seems well-formed, but the total resxat is 
bad in the former t where ,ajlucleus was moved, while it is acceptable 
in the later, where a Satellite was moved. I surr.e^^t that the 
unacceptabi'^Uy of the b-sentences in (l*.80)-(l*.6l) is due to a 
principle or erroneous closxare, rather similar to (2*8). Specifically, 
I claim that the unacceptability of (l*.80b) and (l*.8lb) is due to 
the fact that the bracketed constituents are erroneously apprehended 
as complete sentences, and that they cannot be coherently integrated 
with the remainder of the str^ing without underRoinft significant - 
r eanalysis . 

Thus, in (l*.eo), the string that Bill left is a possible subject 
of i s false , and is interpreted as such; the correct integration of 
whose claim with that Bill lef t would require that the analysis of 
the latter as a subject of the bracketed clause he discarded, and 
that it be reinterpreted as a modifier df another HP which is the 
real subject, of that clause (in fact, the Nucleus of the subject NP, 
but Nuclei are by definition representative of their N & S*s). 

In contrast, even though the bracketed clauses of (l4,82b) and 
('^♦.83b) are possible clauses, their integration with the displaced 
constituents does not require that they be sir>ni^icantly reanalyzed . 
Kor example, the roof in (^4. 83b) is essentially still the subject of 
the bracketed clause, even after the optional appendage of which 
house has been attached to it. Thus, the difference in acceptability 
between Nucleus-chopping and Satellite-choppin;^, in (l4.80)-(l*.83) Is 
due to the fact that the former, but not the latter, requires a 
significant reanalysis of the form in which some substring has bcea 
initially construed. 

It should be pointed out that all the cases of errone<)un 
closure so far considered involved the sifinificant reanalysis (in 
the formidly undefined, but intxiitiyeiy clear sense of the ^bove 
accouTit) of the pertinent string; in fact, I suspect that complexity 
arises only when erroneous closure results in a percept which must 
be significantly reanalyzed at some later stage of processing. It 
would therefore be worthwhile to attempt to provide a rigorous 
definition of the notion 'significant reanalysis*, without which the 
prediction I have Just mad'.t cannot be formally stated* 

We now consider case (b) of (14.79), vhich is illustrated in 
lk.QU)AkM). 

(1*.81*) a« You think those members of the committee nre 
dead. 

* b. *Which members do you think Cg of the committee 

are dead? 



(l4.8i)) a. I told you «Tohn and Mary have left. 

b. *Vnio did I tell you C.. and Mary have Ifftl? 

•J 

(J*. 86) a. That John was fired because he was sick is 
appalling. 

b. *What because he was sick is appalling] 
is that John was fired. 
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(U.8>)^(1*.86) are different from (l*.80).(l*.8l) in that the bracketed 
clauses do not appear to be well-formed; we may expect this to 
function as a clear indication of a slot, thus making integration 
easy. However, (1*.81*)-(1*.86) involve a difficulty of a different 
sort; unlike (lt.82)-(l*.83) , the remainder of the N & S's (i.e., of 
the committee , a nd Mary , and b ecause he was sick respectively), are 
the wrong kind (in the sense of Appendix Two) of category for their 
being arguments of i^y sort of the predicates of the bracketed 
clauses , and the initial analysis of those clauses is rendered 
difficult by the perceptual conflict between the categorial kind of 
some constituent and the role it is called upon to' play. 

Like X'*.79a),, (!♦. 79b)' does not require, new perceptual principles 
for its .explanation. The unacceptability' of (U.8Ub) (l*,85b) and 
(l*.86b) is predicted by (l*.i*8): see section k.3)\ as there are no 
additional categorial or functional cues that could be brought to 
bear with respect to the pertinent bracketed strings, we are faced 
'With a situation of maximal conflict between the role that those 
strings must play and ^the categories to which they belong.. 

The fact that NuoLeus -chopping results, in a perceptually complex 
situation does not, of course, mean that no. counterexamples can exist 
in principle. Consider thus (h.Qjh)^^ which exhibits a violation of 



I am grateful to D.. T. Langendoen for calling this case to my 
attention. 



the Frozen Nucleus Constraint. 

(1+.87) 'a. I saw someone interesting at the party. 

b. Who did I see interesting at the party? 

Notice that we cannot claim that someone interesting: was- moved 
integrally by Question-Movement, the reduced relative clause somehow 
extraposing later, for the latter need: not appear at the end of a 
clause, as shown in (l*.88b). The badness of (l*.89b) is an additional 
argument against extraposition of a reduced relative, for a full 
relative can appear in that position, as shown by the acceptability 
of (l*.89a). 

(1+.88) a. You showed someone irteresting to my girl 
friend from Rome, 
b. Who did you show interesting ta my girl 
friend from Rome? / 

(1+.89) a. Who did you meet yesterday who vas interesting? 
b. *Who did you meet yesterday interesting? 

To bep,in with, notice that the class of adjectives which can 
be substituted for interesting ('*.87b) or (l*.88b) is fairly restricted; 
t!mo, it includes de\astating and frightenlnK , but not pr etty , 
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repugnant , surprising , appalling , or astoninhing (the adjectives 
which« behave lilce ^int er est i ng do not seem to form a natural class, 
but I have not looked at the problem in detail; moreover, there seems 
to be serious idiolectal variation in this respect). 

Secondl/, it can be shown that the structures found in (U.87b) 
or (U.88b) are complex, since acceptability decreases rapidly when 
the chopped elements" belong in more deeply embedded clauses • Thus, 
compare the marginal status of (U.90a) and the unacceptability of 
(U.8la) with the acceptability of (U^90b) and (U^91b), although the 
interdis continuous material is identical in each member of each set. 

(U.90) a* ??Who did I tell you Mary met interesting? 
b. Who did I tell you-Mary met who was 
interesting? 

(U.9r) a. «Who did I tell you Bill claims Mary met 
interesting? 
b* Who did I tell you- Bill claims Msrymet who 
was interesting? 

However, why can the Nucleus in constructions like someone 
interesting be chopped at all? The answer is, I believe, that it 
must , rather than can , /be chopped, in order to avoid cuiother 
unacceptable construction. Indeed, there exists a constraint (what- 
ever its explanation) which prohibits the sequence who interesting , 
as shown in (U.92a)* Thus, sentences like (Ut87) seem to result from 
a trade-off between constraints, such that the conflict is resolved 
by the (partial) suppression of one of them • ^7 Certain adjectival 



^Tof course, there is i.j reason in principle why both constraints 
could not have held (the meaning of (U.87b) can be conveyed by using 
a full, rather than reduced, relative clause), and I am prepared to 
believe that there are dialects in which both Constraints hold, with 
the resxat that both (U*87b) and (l*.92a) are out. 



modifiers are better than others after an interrogative pronoun, and 
suggestion of a trade-off between constraints is supported by the 
paradigm in (U.92)-(U.9U), where Nucleus -chopping seems to increase 
in acceptability as N & S-chopping decreases in acceptability. 

(U.92) a. *Who interesting did you show (to) the audience? 
b. Who did you show interesting^ to the audience? 

(U.93) a. ?Who { ^ a red scarf, did you show (to) 

I wearinfyj 

! 

the audience? 

b.??Who did you show 4 ^^^^ \ a red ncarf to the 
audience? (^wearing J 



* 
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{k.9k) a. Who who was interesting did you show (.to) 
the audience? 
b. ^\Jho did you show who was interesting to the 
audience? 

It is important to distinguish genuine violations of the Frozen 
Nucleus Constraint from spurious ones. Thus, there are a number of 
rules which move Satellites to the end of the containing clauses, and 
when the N & S originates in clause-final, position, its chopping 
followed by riglitwajrd migration of the. Satellite looks as if the 
Nucleus alone had been chopped. Such' situations will not cause 
perceptual problems, because the Nucleus will be synthesized with an 
optional slot for a Satellite (as discussed above), and the 
Satellite will not be in a position which conflicts with its 
categorial kind (since clause final position is ambiguous between 
some-argxunent of the .predicate and 'extraposed' constituent, provided 
that there is a rule of* rightward migration relating the Nucleus and 
the Satellite). One such cass of spurious Nucleus-chopping was 
recognized in Ross (1967), where it was pointed out that sentences 
like (1*. 95c) arise from (1*. 95a) through the intermediate stage (U.95b), 
rather than directly from (l*.95a|. 

(1*.95) I saw a man who was wearing a tie. 

b. Who who was wearing a tie did I see? 
^ Who did I see who was weeoring a tie? 

Other instances of N 4 S chopping followed by Hatellite migration 
have been misconstrued. For example, in Stockwell et al., (1968), 
it is claimed that the heads of postnominal genitival constructions 
like the underscored constituent in (l».96a) can be clefted, yielding 
(l*.96b). I believe that defting yields the intermediate stafce (U.96c), 
from which (1*. 96b) i:i derived by the independently motivated rule of 
Kxtraposition-of-PP. My reasons for thinking bo are that (Ik 97b) is 
not derivable from (1*. 97a)', although we would expect (U.97b) to be 
grammatical if Nucleus-clefting were possible. 

(1*.96) a. I found a necktie of tJohn's. 

b. It's a necktie I found of John's. 

c. It's a necktie of John's that I found. 

(1*.97) a. I showed a necktie of John's to Mary. 

b. *It's a necktie I showed of John's to Mary. 

c. ?*It's a necktie I showed to Mary of John'.s. 

(U.97C) is, unfortunately, bad for independent reasons (extraposition 
movements cannot cross a prepositional phrase in general). The badness 
of (l*.97b) is, however, a sufficient reason for doubting that {h.9Gh) 
arises by Nucleus-chopping. 

1). 1'erlmutter (oral communication) has claimed' that Nl'*a In 
construction with yes 'all' c€m be chopped In lUovenian. Ilin 
argument wiis that C^9&) must be a case of genuine head-chopping, for 
ves exhlbltf; the ordinary accusative ending expected If concord taknn 
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place between it and riz prior to chopping; on the other hand, if 
the head of yes had been copied and deleted through an intermediate 
stage of pronominolization to ga (it), the quantifier would have 
agreed with ga,- exhibiting the animate accusative ending, as in 
(k.99). 

(U.98) Riz sem zelel projesti ves.. 

The rice I wanted to eat all 

(U.99) Zelel sem ga pojesti vsega. 
I wanted it to eat all 

Notice, however, that the above facts do not support Perlmutter' 
claim in case there is a rule of Quantifier -Floating in. Slovenian, 
similar to the one exhibited in ( U. 100 )' for English . 

(U.lOO) a. All the boys have left • 
b; The boys.. all have left. 

c. The boys have all left. 

d. ?*The boys have left ail Cthis sentence has 

become ungrammatical quite recenlty; it was 
perfectly acceptable in 19th century English] . 

As I am not acquainted with Slovenian,. I ccmnot pfovrde an analysis 
of (U;98), but I will show that similar facts obtain in Rximaniaii, 
and that they can be explained there by an independently motivated 
rule of Quantifier-Floating similar to the one foiind in English. 
Thus, consider (U.lOl) and (U.102). - 

(U.lOl) a. Am ySzut Parisul tot. 

I saw Paris all 

•l have seen all of Paris'. 

• b. Parisul, 1-am yazut tot. 
Paris, it I saw all 
•Paris, I have seen all of it.' 



(U.I02) a. Am vSzut Rominia toatS 
I saw Rumania all 
•l have seen all of Rumania.* 

b. Romtnia, am vSzut-o toata. 
Rumania I saw it all 
•Rumania, I have seen all of it.' 

As we can see, the quantifier tot 'all* exhibits the same kind of 
agreement before and after topicalization. However, this quantifier 
can also float quite freely, as shown by (U.103). Consequently, 
sentences like (U.ioib) and (U*102b) can be derived from their a- 
counterparts in two stages, by topicalizing the entire N & S and then 
a].lowing the quantifier to float; 
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(1*.103) a* Toti baietii au plecat. 

'A31 the boys, have left ' * 

b. Baie$ii toti au plecat. 
'The boys all have left' 

c. Baietii au plecat toti^ 
•The boys have left all' 

The conclusion which may be drawn from the above discussion is 
that once the real and/or apparent - counterexamples have been accounted 
for in a principled way, the Frozen Nucleus Constraint' emerges as a 
genuine principle of language, which can receive a 'straightforward 
explanation in terms of general and independently motiyated principles 
of performance. 

In contrast, consider Ross' account of the freezing of Nuclei. 
He only discusses this problem in, relation to complex NP's, and notes, 
in section 2.3, that the chopping of the head of a relative clause 
or of a complement clause 'will be shown to be excluded by either of 
two independent, conditions: the Complex NP Constraint . or the Pied- 
Piping Convention!.. As the CNPC is fonmilated in his l|.l, it cannot 
freeze, the heads of complex NP's, for it only refers to elements of 
the^modifying clause; therefore, it appears Ross decided to leave it 
to the Pied-Piping Convention (and, specifically, to the liCft Branch 
Condition on Jthe latter) to block the migration of the heads of Complex 
NP's. 

I believe that it is wrong to account for this phenomenon with 
the LBC (see .also note 12 )i because of the implied claim that the heads 
of complex NP's cannot move only when they precede the modifVinp clause . 
This conclusion strikes me as unnatural in the extreme; moreover, it 
is probably false. Thus, the head noun of complex NP's always follows 
the modifying clause In Japanese, but they cannot be chopped either. ' 
(li.lOlib) and (l|.105b) show that Japanese Topicalization • which moves 
constituents to sentence initial position, cannot move the head of a 
relative clause or a complement clause respectively* 

(i|.10i|) a^ Kodomo go tabete iru sedcana wa ookii. 

child eating is fish big 
'The fish which the child is eating is big.' 
b. *Sakana wa kodomo ga tabete iru ookii. 
fish . child eating is big 

(U.105) a. Otto ga kono boosi o kabutte it a to iu 
Otto this hat wearing was .that 
syutyoo o watakusi wa sinzita. 
claim I believed 

'I believed the claim that Otto was wearing thin 
hat.' 

b. *Syutyoo wa. Otto ga kono boosl o kabutte ita 
the claim. Otto this hat wearing was 
to iu o watakusi wa sinzita. 
that I believed. 

[ *'The claim, I believed that Otto was wearing 

this hat.' 
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I believe we can conclude that the freezing of the heads of 
complex NP's is much more adequately accounted for by the Frozen 
Nucleus Constraint than by the LBC, since the former, but not the 
•latter can account (a) for the freezing bf complex NP heads which 
fOlloy the modifying clause, and (b) for the freezing of Kuclei 
in constructions other than complex HP's . .^r-i^ 



I 



t 



CHAPTl'JR V 
SIJMI'IARY AND FURTHER OUTLOOKf. 



The purpose of this dissertation has been twofold: 

(5.1) a. To provide an improved syntactic ancdysis of 
the phenomena known as •island constraints*, 
and 

b. to offer a natural explanation of those and 
other phenomena in behavioral terms. 

In regard to (5.1a), I believe the major contribution of this 
dissertation to be the notion Nucleus and 'Satellit e > formally definable 
in terms of the notions dominate and categorial kind ; there is over- 
whelming evidence that the latter two are independently needed in an 
adequate description of natural languages , and a l€trge part of my 
effort has been directed towards showing that the notion of N & r> 
must also be part of a descriptively adequate grammatical theory. 
The notion N & S has made it possible to recognize that the notion 
of . island constraint was a cover term for two largely independent 
and^ weaker constraints , the Frozen Nucleus Constraint and the Sub- 
servient Satellite Constraint; moreover, the excessive power of the 
various island constraints forces the recognition, of a large number 
of counterexamples , an embarrassment which my proposal , on the whole ♦ 
avoids . 

With respect to its second purpose, this dissertation has 
attempted to show that the phenomena discussed in Ross (19^7), l^ostal 
(1971), and elsewhere, can be accounted for in terms of behavioral 
principles involving erroneous jclosure, interrupted behavior, and 
perceptual conflict. Closure principles underlie- my account of some 
instances of the Frozen Nucleus Constraints, as well as of some 
instances of the Right Roof Constraint; interruption principles 
underlie my accounts of some instances of the Cross-Over phenomena 
involving unbounded rules, 'of the Right Roof Constraint, of the 
optionality of Pied-Piping, of the possibility of moving elements out 
of arbitrarily deeply embedded clauses , and of the problems created 
by the migration of elements out of pre-verbal clauses in SVO 
languages; conflict principles underlie my accounts of the Cross- 
Over phenomena involving bounded rules , of some instances of the Frozen 
Nucleus Constraint, of the Subservient Satellite Constraint', and of 
the Langacker-Hoss constraint on backwards pronominalization. 

It has become a commonplace nowadays to end a paper by pointing 
out that it has raised many more questions than it has answered; 
Kiven our extremely inadequate knowled^re of the mental reality under- 
lying syntactic phenomena, such an observation is almost a trulnm. 
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As far as this dissertation is concerned, there are countless 
pertinent, issues it has not even attempted to tackle, such as: 

(5.2) a. What is a possible behavioral principle? 

b. How do behavioral constraints interact with 

semantic ones in determining acceptability? 

c. Are there linguistic phemomena which can be 

explained in terms of speech production, but 
not of speech perception? 

d. Is it possible to predict grammatization, and 

i'f so, what are the nature, the weights, 
and the principles of interaction, of the 
pertinent variables? 

e. ' Are there any 'purely syntactic' universal 

constraints? 

Questions like the above are of great theoretical interest, but I 
do not think the appropriate answers will be forthcoming in the 
near future. 

I believe that problems of more immediate interest concern the 
work which remains -to be done in connection with the issues which 
this dissertation has attempted to tackle. First, as my research 
has centered largely on English (and possibly not even on all the 
pertinent areas of English), my proposals are seriously under- 
determined by the data, and additional research on a reasonable 
sample of the languages and dialects of the world is an absolute must, 
oecond It is certainly desirable to submit as much as possible of my 
analysis to experimental testing, for, even if my proposals are 
plausible ones, they are certainly not the only logically possible 
ones. If this dissertation succeeds in stimulating such work in the 
future, I will consider that it has accompllah-^d Its purpose, even 
If every single claim it has made should eventually turn out to be 



APPENDIX ^ONE 



I do not wish to claim that (2.56) is the deepest representation 
of (2.5l*a), but only a remote representation of it, because (2.56) 
is not semantically interpjretable . Various ways of dealing with the 
semantic interpretation of (2.56) suggest themselves, the most 
satisfactory of which seems to be (a) below. 

(a) At the deepest level, the boxed NP is empty, exactly like 
the Predicate, and the semantic interpretation of this structure is 
carried out by the same rule which interprets the underlying repre- 
sentation of the synonymous (2.5**) (the empty nodes as well as the 
copula would be disregarded). This rule of interpretation is the 
deep structure counterpart of the surface interpretive rule proposed 
in Chomsky (I969): \ftiLle in the latter accoTint, heavy stress is assigned 
optionally and the presupposition is obligatorily inferred in the 
presence of stress, in my accoxxnt, presupposition may or may not exist, 
and heavy stress is obligatorily assigned in the presence of pre- 
supposition. 

The presupposition of a sentence must be a proper subpart of 
the meaning of that sentence. Thus, (Al.la) may presuppose not only 
(Al.le), but also (Al.lb, c, or d); 

(AI.1) a. John saw Mary. 

b. John saw a girl. 

c. John saw a person. 

d. John saw someone. 

e. John saw something. 

Corresponding to (Al.lb)-(Al.le) , the pseudo-cleft output will be 
(A1.2b)-(A1.2e) respectively. 

(A1.2) b. The girl who John saw was Mary. 

c. The person who John saw was Mary. 

d. The one who John saw was Mary. 

e. What John saw was Mary. 

similarly (A1.3a) may presuppose (A1.3b), in addition to (A1.3c). 

(AI.3) a. I noticed John's grisly sense of humor. 

b. I noticed something about John. 

c. I noticed something. 

Corresponding to (A1.3b) and (A1.3c), we have the pseudo-clefts 
CAl.l4b) and (Al.Uc). 

(AI.I4) b. What I noticed about John was his grisly sense 
of hmnor. 

c. What I noticed was John's grisly sense of humor. - 
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Sentences like (Al.l*b) have been claimed by Green (1971) to have 
no non-pseudo-cleft counterpart and thus to create serious problems 
for any analysis of pseudo-cleft constructions which includes a non- 
pseudo-cleft counterpart. However, given universal principles of 
semantic implication, the non-pseudo-cleft counterpart of (Al*l*b) is 
(Al.i+a), not the ungrammatical *I noticed John's grisly sense of 
humor about him which Green regards as the source of the difficulty. 

In slightly more complicated situations, the presupposition is 
inferrable from the non-pseudo-cleft counterpart in conjunction with 
a number of pragmatic assumptions (for some discussion, see Lakoff 
(1971)). • 

Consider (A1.5a) and the pragmatic assumptions (Al,5b)-(A1.5e) • 

(A1.5) a.. A man like Hitler was t*-"* loved to seize power, 

b. Hitler was a vicious tyrcmt. 

c. The world has a duty to prevent vicious 

tyrants from seizing power. 

d. The world did not prevent Hitler from seizing 

power . 

e. When one does not do one's duty, something is 

wrong with the one who thus fails. 

From (A1.5b)-(A1.5e), we con infer (A1.6). 

(A1.5) nonething is wrong with the world. 

Taking (A1.5a) as the non-psoudo-cleft counterpart and (A1.6) as the 
presupposition, we can construct the pseudo-cleft in (A1.7), 

(A1.7) What's wrong with the world is that a man like 
Hitler wets allowed to seize power. 

Heturnin;: now to the simpler (2.:Jl*a), the remote representation 
is derived by copying something (i.e., the pro-form corrennondinp; 
to the focus) into the boxed IIP, and definitizing it later because of 
the 'preyio.us mention' (that definite pronouns and determiners can 
have their antecedents in presuppositions is shown by sentences like 
the boy saw f>lary, which is meaningless, unless a previous mention of 
the referent, of the boy is presupposed). 

If tHe~rule which inserts a copy of the pro-form into the boxed 
NP seems insufficiently motivated, consider the alternatives: 

(b) We may regard (2.56) as a pre-semantic structure, in which 
case the semantic interpretation would apply to the output of Focus- 
Placement (see below), that is, (A1.8). 
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(A1.8) 




The thing John ate something^ vas an apple 



Presupposition coxild also be determined at this stage (on the basis 
of the general principle that restrictive relatives with 'definite deter- 
miners are presupposed), and so could coreferentlajity. How'^ver, apart 
•from the fact that singling out the output of Focus-Placement for 
semantic interpretation would be no less arbitrary than the copying, rule 
I have proposed, this solution could not work because Focus-Placement 
has to follow the rule which determines the distribution of anjr (see 
the discussion in connection with examples (2.1)8) and (2.'jy)^ which 
in turn has to follow Passive, as (A1.9) shows: 

(A1.9) a. ^Anybody didn't kill John. 

b. John wasn't killed by anybody. 

c. John didn't kill anybody. 

d. ^Anybody wasn't killed by John. 

But if Focus -Placement has to follow Passive, it would be wrong to 
single out its output for semcLntic interpretation, for, in case the 
l^assive has applied, pMsive and active structures would have to be 
interpreted by separate rules, thus missinf^ an obvious generalization. 

(c) Another possibility would be to regard (i) as essentially 
the deepest representation of (2.1), as proposed in Postal (1971). ''uch 
an analysis would create nc» problem for semantic interpretation, but 
would entail the loss of all the syntactic generalizations al>out pseudo- 
cleft constructions which will be discussed below, namely the correct 
statement of selectional restrictions, the distribution of neutralized 
elements, the impossibility of removing elements from pseudo-cleft 
phrases, and the fact that the acceptability of pseudo-cleft 
constructions is determined by the principles which limit rirJitward 
movements i.e., (2.35c). Moreover, to derive sentences like {2.')Uh)^ 
Postal is forced to regard something like (2.5l4a) cis a more remote 
representation and to posit- a rule of Contrast-Movement. That rule 
would have to apply obligatorily to (Al.lOa) to derive (Al.lOb). 
The 111-formedness of (Al.lOa) appears as an accidental fact in this 
analysis. 

(Al.lO) a. *What I saw Bill the table wets under, 
b. I saw Bill under the table. 

On the other hand, if (Al.lOb) in regarded as more banic than 
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(Al.lOa), the latter is excluded by the general principle that trans- 
romations do not nove 'function words' (or pai'ts of words as 
pointed out in Chomsky (1969)). 1 * 

A still more serioiis problem is raised by (Al.lla) which one 
could say in reply to (Al.llb). but for which, given Postal's 
analysis, it is hard to imagine a source. 

(A1.11) a. No, John saw the house. — ' 
b. Jill saw the table. 

Given the untenability of (b) or (c) above on syntactic Rrounds. 
I have assumed an analysis along the lines of (a) in 2.1.3.1, because 
the awkwardness of having empty nodes in underlying structure is not 
an intolerable difficulty, and because (a) makes it possible to 
handle the syntactic problems involved in- pseudo-clefting, which is 
a primary concern of that section. 



1 
I 
I 

I 

I 

I 



APPENDIX TWO 

Optionality is in fact not important for its own sake, but 
rather for showing that tne N & S-node is necessarily of the same 
•categorial kind* as in the Nucleus-node. The notion of 'kind* that 
I have in mind is ill-defined in linguistic theory at present, since 
the onJy notation available for labeling categories does not 
differentiate between gerundials, that-clauses and f or-to- clauses ^ 
for example, all being assigned the label fJ. Nevertheless, there is 
a great deal of syntactic evidence that finer categorial distinctions 
are needed, since transformations are sometimes sensitive to degrees 
of sentencehood, nominality, or adjectivalness (for details, see 
Ross (1972)). The failure to recognize such degrees has been 
responsible, I believe, for a great deal of xinnecessary argumentation 
in the literature* For example, Emmonds (1969) argued that geruhdials 
are NP's, not S's, because they do not extrapose; on the other hand, 
gerundials behave like sentences with respect to other syntactic 
phenomena, such as pronominalization constraints. Thus, unless we 
regard gerundials as clauses it is hard to account for the ungrammati- 
cality of (A2.1). 

(A2.1) ^He^ disapproves of your flattering Johnj. 

The evidence, however, no longer appears contradictory if we posit 
a continuous scale ranging from nominality to sentencehood, such as 
in (A2.2). 

(A2.2) 



a • that-clauses 
b • For-to- clause s 

c. gerundial clauses 

d. gerundial nominalizations 

e. deverbal nouns 
WaMIHALITY 

A categorial scale like (A2.2) imposes an implicational constraint, 
in the sense that every rule or constraint which mentions the category 

or HP must partition the scale into two continuous sets, one of 
which may be null. That is, the correctness of (A2.2) would be 
challenged by a constraint which would bloc:k (A2.2b) and (A2.2d), but 
not (A2.2a, c, and e). I do not know of such counterexamples, and I 
give a few examples, below which support (A2.2). (A2.3)-(A2.5) show 
(A2.2) is confirmed by the Sentential f5ubject Constraint, the NP-over- 
r> constraint, and Chomsky's Specified Hubject Condition (nee Appendix 
Three ) respectively . 
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(A2.3) a. *It*s John who that you killed is clear. 

*It*s John who for you to meet may be dangerous, 
c» *It*s John who meeting may prove dangerous* 

d. . It's John who the shooting of has become 

imperative. 

e. It's John that a pictxire of is in my pocket. 

i'A2.h) a. ^Is that John left obvious? 

b. *Is for John vo leave desirable? 

e. Is John's leaving the country desirable? 

d. Is John's shooting of so many rabbits legal? 

e. Is John's picture of Maiy available to you? 

(A2.5) a. It's -John who I wag; told that you met at, the 
party. 

. b. It's John who I intend for you to meet. 

c. *It's John who I am planning on Mary's yneeting. 

d. *It's John Mho 1 am planning on a detective's 

shooting of. 

e. *It's John that I saw Bill's picture of. 

Two more phenomena which support (A2.2) we Extraposition and 
Extraposition-from-NP, as shown in (A2.6) rmd (A2.7) respectively. 

^ (A2.6) a. It's desirable iihat John should leave. 

b. It was expected for John to leave. 

c. *It's regrettable John's leaving. 

d. *It's regrettable John's shooting of the 
* rabbits. 

e. *It was unexpected John's proof of the theorem. 

(A2.7) a. The claim is preposterous that John is a 
traitor. 

b. *The hope is unrealistic for John to get well. 

c. *The prospects are dim of John's getting well. 

d. *The possibility is frightening of the shooting 

of so many rabbits. 

e. *The idea never occurred to me of the proof of 

the theorem. 



Concerning (A2.7e)» it should be pointed out that there are 
sentences like (A2.8)^ which apparently violate the hierarchy in 
(A2.2). 

(A2.8) The choice is hard between A and B. 



Thus, uhlike (A2.7e), (A2.8) is acceptable, even t;aough the constituent 
extraposed from NP is a PP (which is presumably at the nominal' end of 
the scale). However, it seems reasonable to assume that (A2.8) is 
not derived by Extraposition* f rem* NP, but by another rule (usually 
called ExtrapositionrOf-PP). Although I have not investigated the 
domain of applicability of the latter rule, it seems clear that it 
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is more restricted than that of the former, and I suspect that 
detailed investigation would turn out additional formal differences 
. between the two rules. 

Another reason for believing that the two rules should not be 
collapsed is that there are acceptable cases of 'Extraposition-of- 
NP ; that is, exactly as there are grammatical sentences apparently 

derived like (A2.7e), there also are grammatical sentencerfiki 

{A2.9) which are apparently derived like (A2.6)'. 

U2.9) It's awful the way he behaves in front of women. 

That (A2.9) is not derived by ordinary Extraposition is argued in 
detail in Elliot (1971, Ch. 2), where it is shown-that there is a 
separate rule of Extraposition-of-NP restrict^ to" "exclamatory' 
predicates. j . 

If Extraposition and Extraposition-from-NP are not involved 
in the derivation of (A2.9) and (A2.8) respectively, the claim that 
these two rules recognize (A2.2) is consistent with the facts. 

The notion of categorial kind as illustrated by (A2.2) is clearly 
not the only possible or existing one. Although I do not have anything 
like a tTieory of categorial kinds at present, I feel it is rather 
obvious that a prepositional phrase and the noun phrase it necessarily 
contains are not NP's of the same kind. For example, the doubly- 
underscored NP in (A2.10a) cannot be substituted for the singly- 
underscored one, as (A2.10b) suggests. 

(A2.10) a. I did it for John , 
b. *I did it Johni 

ijimilarly, although coordinations are represented as in (3.99), so 
that the boxed X and X2 belong to the same category, it is quite 
clear that they are not of the same kind, for the former cannot 
appear in most contexts in which the latter can, as illustrated in 
(A2.11) . 



(A2.11) a. Bill left. 

b. «0r Bill left. 



The ungramraaticality of (Al.lOb) and (A2.11b) is a consequence 
of the obligatory character of prepositions in PP's and of or in 
second disjuncts, as opposed to the optional character of non- 
restrictive relative clauses in NP's. Therefore, even if the notion 
of categorial kind could be made explicit in linguistic theory, 
replacing (3.I) by another definition (3.1') which would differ from 
(3.1) in lacking provision (iii) and in having a strengthened 
provision (i) such that X and Y would be required to belong to the 
same kind of category would be no more than proposing a notational 
variant, so long as optionality is the only reaaon for claiming 
that X and Y are of the same kind. 

Notice also that (3.1) is trying to capture the intuition that 
It is g significant fact about N & U's that their Nucleua is of the 
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same kind as the entire construction. Thus (A2.12) contains two 
HP's of the same kind, which are moreove> substitutable in the 
context given; however, the fact that the two NP*s are of the same 
kind is not a necessary or significant one, and (3.1) correctly 
prevents the internal NP from being regarded as the Nucleus of the 
larger one. 

(A2.12) A man who was talking to a woman had a heart 
attack. 



APPENDIX THREE 



The fact that some cases of Extraposition-from-NP are awkward 
does not indicate that (3«2iii) has questionable status; rather, 
the awkward cases can be explained by independent constraints. We 
have .already seen two such constraints (in Chapter IVo): (a) the 
acceptability of extraposed clauses decreases in proportion to the 
complexity of the inaterieLL which intervenes between- them and their 
original head,, and (b) clLI other things being equal, a readinf; of a 
sentence that involves Extrap6sition-from-NP tends to be discarded 
in favor of cmother reading which involves n'o* extraposition or 
extraposition over less complex material. These constraints were 
shown in Chapter Two to be quite general and in no way restricted to 
the rule in question. I wish to show now that (c) Extraposition- 
rrpm-NP yields increasingly unacceptabl.e results in proportion to 
the decree of specificity of the head NP. Thus, consider (A3.1). 

(A3.1) a. A man Just .Left who was wearing a hat. 

b. ?The man just left who was wearing "a hat. 

c. ??That man just left who was wearing a hat. 

d. ^John's brother Just left who was wearing a 

Kat. 

e. **John's brother's son's daughter Just left 

who was vearing a hat. 

The constraint which is operative in (A3.1) is clearly more general 
than meets the eye; indeed, as was originally pointed out by 
Warshawsky (I966) and recently elaborated on in Chomsky (lyYl), the 
complements of NP's are increasingly impervious to operations 
involving the complement and f?.n element external to the N & S in 
proportion to the specificity of the determiner of the Nucleus. 
Tims, the reflexivization or chopping of a nominal complement 
exhibit the same gradual deterioration as that seen in (A3.l)> fits 
yhown by (A3.i^) and (A3. 3) respectively (additional phenomena which 
behave (;imilarly are discussed in Chomsky (l97l)). ' 

(A3.i^). a. J gave John a picture of himself. 

b. ?I gave John that picture of himnelf. 

c. *I gave John Mary's picture of himnolf. 

d. **l gave John Mary's sister's daughter's 

picture of himself. 

(A3. 3) a. What did I give you a. picture of? 

b. ?What did I give you thia.t picture of? 

c. *What did I give you Bill's picture of? 

d. ^^What did I give you Bill's father's brother's 

picture of? , 
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Chomsky proposes to handle these cases by his Specified Subject 
Condition ^ which T reproduce below: 

(A3.^) No rule can involve L> "^^e structure 

where Z is the specified subject of WYV in a. 

This condition in effect states, that a discontinuous relation cannot 
cross a specified subject, but (A3.1) shows that the crossing is 
irrelevant. Moreover, (A3.1) which is intuitively felt to be a 
subcase of whatever constraint is operative in (A3. 2) and XA3.3) 
cannot be handled by (AS.^). I suggest that a more observationally 
adequate statement wpuld be (A3. 5). 

(A3. 5) In & S's belonging to the category NP, (3.2iii) 
becomes increasingly inapplicable in proportion 
to the specificity of the Nucleus. 

As shown by (A3.2) , the increasing inapplicability of (3.2iii) 
concerns not just chopping, but feature-changing as well (similar 
facts hold for the distribution of any , as shown in Ross (1967)). 
In fact, (A3. 5) affects the same rules as those affected by (3.2iv); 
as shown in (A3. 6), (A3. 5) affects deletion under obligatory 
coreferentiedity, but not non-nuU pronominalization. Copying rules 
are not affected either (as illustrated in (A3.7) and (A3. 8) with 
Left Dislocation and WH-Q-Copying respectively, but further research 
is necessary, as I have not been able to check the acceptability of 
(A3. 8) with enough informants who question by copying). 

(A3. 6) Mary is too ugly for John to want to own 
ra picture of 1 
1 this picture of?? \ 
, Ed's picture of* 
i^Bill's son's picture of*^ 

It should be noticed that the entire set in (A3. 6) becomes acceptable 
if her is added at the end of each sentence. 



John's father's) 



(A3. 7) My father, I think I mentioned V this 

/ Bill's 

picture of him. 

(A3. 8) Who did I shov you I that ^ picture 

I Bill's 

Mary's mother's 



of him? 

The above facts suggest that (A3. 5) should be replaced by 
something like (A3.9). 
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(A3. 9) The phenomena which cannot affect elements of a 
Satellite, as predicted by (3.2iv)., are 
prevented from affecting the entire Satellite, 
(a) if the Nucleus is an NP, and (b) in 
proportion to the specificity of the Nucleus. 

If (A3. 9) is correct, then it must be teJcen into account when 
explanations for (3.2iv) are considered, for it is suggeflted rather 
strongly that the fact that the same restrictions hold for elements 
of Satellites and for whole Satellites under conditions of specificity 
is more than a mere coincidence. 

Finally, notice that (A3. 9) explains the often-noted fact that 
many speakers prefer to extrapose relative clauses rather than 
sentences in apposition to NP*s like the fact . Indeed, the determiner 
of the Nucleus in the latter case can often not be less specific than 
the , in view of the ill-formedness of fact that John is crazy is 
distiirbing (see section 3.2.3), and Nuclei having the as determiner 
tend to inhibit extraposition, regardless of the kind of clause to be 
extraposed. 



APPENDIX FOUR 



In this .Appendix, I consider a possible explanation of (3.110) 
in terms of (3.2) • I4y suggestion is that (3.110) is in fact 
unnecessary, since whatever purpose it purports to serve is 
essentially served by (3.2iv). Thus, (3.2iv) only precludes the 
migration of an element of a Satellite from under the dominance of 
the K & S node, but does not preclude movements within the domination 
of the W & S node; for example, relative pronouns can be attracted 
to their antecedent. A number of non-chopping phenomena which are 
subject to (3.2iv) create no unacceptabilities when they involve an 
element internal to the Nucleus and another internal to the Satellite; 
this is illustrated with ever -neutralization, deletion under obligatory 
coreferentiality, and comparative neutralization in (a1*.1)-(aU,3) 
respectively. 

(aU.I) I know no one who ever shot two lions, with one 
shot. 

(aU*2) The man (who) you mentioned has left. 

(aU.3) John loves Mary more than he hates Jill. 

To explain (3.110) we need only to assume that, when the Nucleus and 
the Satellite of an N & S contain an identical element in 'similar' 
positions, the element in the Satellite is first moved into the 
Nucleus where it ^s-« ' gobbled • by the identical element; the latter 
is subsequently moved out of the N i S. Thus, I am positing 
sequential, rather than simultaneous, movements in across-the-board 
processes. Gobbling rul'es of the type suggested above have been 
proposed in various places in the literature; the syntactic 
JustificationjDf my proposal is that it avoids the embarrassment 
of regarding across-the-board movements as exc*»ptions to (3.2iv). 
This account makes sense especially in behavioral terms; indeed, 
if the reconstruction of discontinuities created by across-the- 
board movements proceeds as in (aU.5), a perceptual conflict will 
arise (see section U.3), but if it proceeds as in (AU.Ua) or (AU.Ub), 
no "difficulty will arise. 

A minor apparent theoretical difficulty concerns the seemingly 
arbitrary choice between (AU.Ua) or (AU.Ub).in symmetric coordinations. 
The real-time procedure is probably (AU.Ua), since the first term 
carries no mark of subordination, and there consequently aeems to 
be no reason for the reconstructive procedure to hold the first term 
unresolved until the second is encountered. Similarly, in n-term 
coordinations, reconstruction very probably proceeds from left to 
right. 
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Uh.k) a. 




(AU.5) 



N & S 




An alternative way of dealing with across-the-board maiming 
while preserving the CSC is implicitly proposed in Neubauer (1970). 
Weubauer's analysis is interesting to consider because it makea the 
claim that copying rules are, exempt from the constraints which affect 
chopping rules ^ even when copying is followed by the deletion of the 
o riginal s so that the end product is indistinguishable from chopping. 

Neubauei! '.s, argument hinges on the fact that Conjunction- 
Reduction can remove elements from coordinations , tinlike Question- 
Movement^ Relativization^ Pseudo-Clef ting, etc., and argues that this 
is so because Conjunction-Reduction is a copying and deletion, rather 
than a chopping, inxle. However, Conjtinction-Reduction is the only 
rule considered by Neubauer which operates across-the-board. Since 
the various chopping rules are also free to operate across-the-board, 
and since it would be exceedingly ad hoc to regard them as copying 
and deletion rxiles when operating across-the-board and as chopping 
ones otherwise, the inevitable conclusion is that chopping rules do 
not apply across-the-board to coordinate terms, but rather to the 
output of some copying and deletion rules. The only copying and 
deletion rule which applies to coordinations in English i.s Conjunction- 
Reduction, and Neubauer 's position implicitly claims that (AU.Cc) is 
derived from (AU.6a) via (AU.6b) rather than directly . 



ikh.C) a. John likes potatpes and Mary hates potatoen. 
b. John likes, and ftary hates, potatoes, 
c* It's potatoes that John likes and Mary hates. 



I 



- 221 - 

Such an analysis must be rejected for several reasons: ^ 

(a) The pauses indicated by commas in (AU.6b) are obligatory, 
while no such requirement holds for (aU.6c). Conceivably, one might 
claim that pause insertion is a later rule than Cleft ing, and that 
the former somehow depends on the moved element being to the right 
of its place of departure. But such a claim would dissolve in the 
face of (AU.7) in which the pauses are perfectly optional, although 
the moved element is to the right of its places of departure in 
surface structure. 

(AU.7) What John likes and Mary hates is potatoes. 

(b) Conjunction-Reduction can only touch initial and final 
elements of coordinate terms, but chopping rules can touch medial 
elements as well. Therefore, (AU.8a) would have no source, since 
(AU.8b) is ungrammatical and (AU.8c), while grammatical, arises 
through Topicalization rather than through Conjunction-Reduction. 

(AU.B) a. 'It's John who I saw in Paris yesterday and 

you saw in London the day before yesterday. 

b. *I saw in Paris yesterday, and you saw in 

London the day before yesterday, John. 

c. John(,) I saw in Paris yesterday and you 

saw in London the day before, yesteirday. 

(c) As noted above, across-the-board chopping is possible, with 
various degrees of acceptability, in non-coordinate structures.^ 
However, Conjunction-Reduction is completely out in such cases, as 
shown in (AU.9)~(AU.11) . 

(AU.9) a. It's Mary who John likes more than Bill hates, 
b. *John likes, more than Bill hates, Mary. 

(aU.10) a. It's John Mary told that she loves • 
b. *Mary told» that she loves, John. 

(AU.11) a. It's John who the undisputable effort to save 
suggests a hidden intention to destroy, 
b. *The undisputable effort to save, suggests a 
hidden intention to destroy, John. 

The only way of upholding Neubauer's position that I can see is 
to pocit a copying and deletion rule, call it Mysterious-Reduction, 
which would derive (AU.6b) from (AU.6a) without requiring pause 
insertion, which would also derive structures like the b-sentences 
in (AU.8)-(AU.11) , and which would moreover be obligatorily followed 
by some chopping rule applying to the constituent it has copied. I 
assume that the artificiality of such a proposal is sufficiently, 
obvious for the matter not to be pursued any further. 
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